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Abstract The classical Lojasiewicz inequality and its extensions for partial differential equa- 
tion problems (Simon) and to o-minimal structures (Kurdyka) have a considerable impact on the 
analysis of gradient-like methods and related problems: minimization methods, complexity the- 
ory, asymptotic analysis of dissipative partial differential equations, tame geometry. This paper 
provides alternative characterizations of this type of inequalities for nonsmooth lower semicon- 
tinuous functions defined on a metric or a real Hilbert space. In a metric context, we show that a 
generalized form of the Lojasiewicz inequality (hereby called the Kurdyka-Lojasiewicz inequali- 
ty) relates to metric regularity and to the Lipschitz continuity of the sublevel mapping, yielding 
applications to discrete methods (strong convergence of the proximal algorithm). In a Hilbert 
setting we further establish that asymptotic properties of the semiflow generated by —df are 
strongly linked to this inequality. This is done by introducing the notion of a piecewise subgra- 
dient curve: such curves have uniformly bounded lengths if and only if the Kurdyka-Lojasiewicz 
inequality is satisfied. Further characterizations in terms of talweg lines — a concept linked to the 
location of the less steepest points at the level sets of / — and integrability conditions are given. 
In the convex case these results are significantly reinforced, allowing in particular to establish 
the asymptotic equivalence of discrete gradient methods and continuous gradient curves. On 
the other hand, a counterexample of a convex C 2 function in R 2 is constructed to illustrate the 
fact that, contrary to our intuition, and unless a specific growth condition is satisfied, convex 
functions may fail to fulfill the Kurdyka-Lojasiewicz inequality. 
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1 Introduction 



The Lojasiewicz inequality is a powerful tool to analyze convergence of gradient-like methods 
and related problems. Roughly speaking, this inequality is satisfied by a C 1 function /, if for 
some G [i, 1) the quantity 

i/-/(f)Hiv/ir i 

remains bounded away from zero around any (possibly critical) point x. This result is named 
after S. Lojasiewicz [33], who was the first to establish its validity for the classes of real-analytic 
and C 1 subanalytic functions. At the same time, it has been known that the Lojasiewicz 
inequality would fail for C°° functions in general (see the classical example of the function 
x i — ► exp(— 1/x 2 ), if x ^ and 0, if x = around the point x = 0). 

A generalized form of this inequality has been introduced by K. Kurdyka in [29]. In the 
framework of a C 1 function / defined on a real Hilbert space [H, (•,•)], and assuming for simplicity 
that / = is a critical value, this generalized inequality (that we hereby call the Kurdyka- 
Lojasiewicz inequality, or in short, the KL-inequality) states that 

||V(^o/)(x)||>l, (1) 

for some continuous function (p : [0, r) — ► R, C 1 on (0, r) with cp' > and all x in [0 < / < r] : = 
{y £ H : < f(y) < r}. The class of such functions ip will be further denoted by /C(0,f), see 
([8]). Note that the Lojasiewicz inequality corresponds to the case ip(t) = t l ~ d . 

In finite-dimensional spaces it has been shown in [29J that ([T]) is satisfied by a much larger 
class of functions, namely, by those that are definable in an o- minimal structure [15] . or even 
more generally by functions belonging to analytic-geometric categories [21] . In the meantime the 
original Lojasiewicz result was used to derive new results in the asymptotic analysis of nonlinear 
heat equations [40] and damped wave equations [26]. Many results related to partial differential 
equations followed, see the monograph of Huang [27] for an insight. Other fields of application of 
(PQ) are nonconvex optimization and nonsmooth analysis. This was one of the motivations for the 
nonsmooth KL-inequalities developed in [HJ E] - Due to its considerable impact on several field of 
applied mathematics: minimization and algorithms [H [HI [30] , asymptotic theory of differential 
inclusions [38] , neural networks [24] , complexity theory [37] (see [37} Definition 3] where functions 
satisfying a KL-type inequality are called gradient dominated functions), partial differential 
equations [401 [26l [27], we hereby tackle the problem of characterizing such inequalities in an 
nonsmooth infinite-dimensional setting and provide further clarification in several application 
aspects. Our framework is rather broad (infinite dimensions, nonsmooth functions), nevertheless, 
to the best of our knowledge, most of the present results are also new in a smooth finite- 
dimensional framework: readers who feel unfamiliar with notions of nonsmooth and variational 
analysis may, at a first stage, consider that all functions involved are differentiable and replace 
sub differentials by usual derivatives and subgradient systems by smooth ones. 

A first part of this work (Section 2) is devoted to the analysis of metric versions of the 
KL-inequality. The underlying space H is only assumed to be a complete metric space (without 
any linear structure), the function / : H — > R U {+00} is lower semicontinuous and possibly 
real-extended valued and the notion of a gradient is replaced by the variational notion of a strong- 
slope [THIS]. Indeed, introducing the multivalued mapping F(x) = [f(x), +00) (whose graph is 
the epigraph of /), the KL-inequality ([1]) appears to be equivalent to the metric regularity of 
F : H R on an adequate set, where R is endowed with the metric d»,(r,s) = \<p(r) — (f(s)\. 
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This fact is strongly connected to famous classical results in this area (see [H] [35], EHJ [39] for 
example) and in particular to the notion of p-metric regularity introduced in [28] by A. Ioffe. The 
particularity of our result is due to the fact that F takes its values in a totally ordered set which 
is not the case in the general theory. Using results on global error-bounds of Aze-Corvellec [6] 
and Zorn's lemma, we establish indeed that some global forms of the KL-inequality and metric 
regularity are both equivalent to the "Lipschitz continuity" of the sublevel mapping 

r r =t h 

\ r l— y [f < r] := {x € H : f(x) < r}, 

where (0, r) C (0, +oo) is endowed with cL, and the collection of subsets of H with the "Hausdorff 
distance". As it is shown in a section devoted to applications (Section 3.4), this reformulation 
is particularly adapted for the analysis of proximal methods involving nonconvex criteria: these 
results are in the line of [5] . 

In the second part of this work (Section 3), H is a proper real Hilbert space and / is assumed 
to be a semiconvex function, i.e. f is the difference of a proper lower semicontinuous convex 
function and a function proportional to the canonical quadratic form. Although this assumption 
is not particularly restrictive, it does not aim at full generality. Semiconvexity is used here 
to provide a convenient framework in which the formulation and the study of sub differential 
evolution equations are simple and elegant ([2] H7J). Using the Frechet sub differential (see 
Definition [8]) , the corresponding subgradient dynamical system indeed reads 

( x(t) + 8f(x(t)) B 0, a.e. on (0,+oo), , . 

\ x(0) G dom/ [ ' 

where x(-) is an absolutely continuous curve called subgradient curve. Relying on several works 
|17 [ I34 [ [TT]. if / is semiconvex, such curves exist and are unique. The asymptotic properties of the 
semiflow associated to this evolution equation are strongly connected to the KL-inequality. This 
can be made precise by introducing the following notion: for T G (0, +oo], a piecewise absolutely 
continuous curve 7 : [0, T) — > H (with countable pieces) is called a piecewise subgradient curve 
if 7 is a solution to ([2]) where in addition t 1— > (/ o 7)(i) nonincreasing (see Definition 1151 for 
details). Consider all piecewise subgradient curves lying in a "KL-neighborhood" , e.g. a slice 
of level sets. Under a compactness assumption and a condition of Sard type (automatically 
satisfied in finite dimensions if / belongs to an o-minimal class), their lengths are uniformly 
bounded if and only if / satisfies the KL-inequality in its nonsmooth form (see [9]), that is, for 
all x G [0 < / < r], 

\\d(<p o := inf{|H| : p G d(ip o /)} > 1, 

where ip : (0, r) — ► R is C 1 function bounded from below such that tp' > (see ©)• A 
byproduct of this result (through not an equivalent statement, as we show in Section 4.3 - 
see Remark [37] (c)) is the fact that bounded subgradient curves have finite lengths and hence 
converge to a generalized critical point. 

Further characterizations are given involving several aspects among which, an integrability 
condition in terms of the inverse function of the minimal subgradient norm associated to each 
level set [/ = r] of /, as well as connections to the following talweg selection problem: Find a 
piecewise absolutely continuous curve 9 : (0, r) — ► H with finite length such that 

e(r)e\xe[f = r]:\\d(<pof)( x )\\_<R inf \\d(<p o 1 , with R > 1. 

I ye[f=r] J 
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The curve 9 is called a talweg. Early connections between the KL-inequality and this old concept 
can be found in [29], and even more clearly in [16]. Indeed, under mild assumptions the existence 
of such a selection curve 6 characterizes the KL-inequality. The proof relies strongly on the 
property of the semiflow associated to —df. Recent developments of the metric theory of 
"gradient" curve ([3]) open the way to a more general approach of these characterizations, and 
hopefully to new applications in the line of [3] 118j. 

The analysis of the convex case (that is, / is a convex function) in Section 4, reveals inter- 
esting phenomena. In this case, the KL-inequality, whenever true on a slice of level sets, will be 
true on the whole space H (globalization) and, in addition, the involved function tp can be taken 
to be concave (Theorem [29]) . This is always the case if a specific growth assumption near the 
set of minimizers of / is assumed. On the other hand, arbitrary convex functions do not satisfy 
the KL-inequality: this is a straightforward consequence of a classical counterexample, due to 
J.-B. Baillon [7], of the existence of a convex function / in a Hilbert space, having a subgradient 
curve which is not strongly converging to G argmin/. However, surprisingly, even smooth 
finite-dimensional coercive convex functions may fail to satisfy the KL-inequality, and this even 
in the case that the lengths of their gradient curves are uniformly bounded. Indeed, using the 
above mentioned characterizations and results from [H], we construct a counterexample of a C 2 
convex function whose set of minimizers is compact and has a nonempty interior (Section 4.3). 

As another application we consider abstract explicit gradient schemes for convex functions 
with a Lipschitz continuous gradient. A common belief is that the analysis of gradient curves and 
their explicit discretization used in numerical optimization are somehow disconnected problems. 
We hereby show that this is not always the case, by establishing that the piecewise gradient 
iterations are uniformly bounded if and only if the piecewise subgradient curves are so. This 
aspect sheds further light on the (theoretical) stability of convex gradient-like methods and the 
interest of relating the KL-inequality to the asymptotic study of subgradient-type methods. 

Notation. (Multivalued mappings) Let X, Y be two metric spaces and F : A Y be a 
multivalued mapping from X to Y. We denote by 

Graph F := {(x,y) G X x Y : y G F(x)} (3) 

the graph of the multivalued mapping F (subset of X x Y) and by 

domF := {x G X : By G Y, (x,y) G Graph F} (4) 

its domain (subset of A). 

(Single-valued functions) Given a function / : X — > R U {+00} we define its epigraph by 

epi/:={(x,/3) G XxR:f(x)<(3}. (5) 

We say that the function / is proper (respectively, lower semicontinuous) if the above set is 
nonempty (respectively, closed). Let us recall that the domain of the function / is defined by 

dom/ := {x G X : f(x) < +00}. 

(Level sets) Given n < r2 in [—00, +00] we set 

b"l < / < T2\ := {x G A : r\ < f(x) < r2}. 
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When t\ = V2 (respectively r\ = — oo), the above set will be simply denoted by [/ = n] 
(respectively [/ < r<$). 

(Strong slope) Let us recall from [18] (see also [28], [6]) the notion of strong slope defined for 
every x G dom / as follows: 

it,-./ \ v (f( x ) ~ f(v)) + / fi x 

|V/|(x) = hnrsup d ^ y) , (6) 

where for every a G M we set a + = max {a, 0}. 

If [X, || • ||] is a Banach space with (topological) dual space [X*, \ \ ■ ||*] and / is a C 1 finite- 
valued function then 

|V/|(x) = ||V/(x)|U, 
for all x in X, where V/(-) is the differential map of /. 

(Hausdorff distance) We define the distance of a point x G X to a subset S 1 of X by 

dist (x, S) := infd(a;,y), 
yes 

where d denotes the distance on X. The Hausdorff distance Dist (Si, S2) of two subsets Si and 
5*2 of X is given by 



Dist(<Si, S2) := max < sup dist (x, S2), sup dist (x, S\) > . (7) 

Let us denote by V(X) the collection of all subsets of X. In general Dist(-, •) can take infinite 
values and does not define a distance on V(X). However if K(X) denotes the collection of 
nonempty compact subsets of X, then Dist(-, •) defines a proper notion of distance on K{X). In 
the sequel we deal with multivalued mappings F : X 14 Y enjoying the following property 

Dist (F(x),F(y)) < k d(x,y) 

where k is a positive constant. For simplicity such functions are called Lipschitz continuous, 
although [P(Y), Dist] is not a metric space in general. 

(Desingularization functions) Given f G (0, +00], we set 

£(0,f) := C([0,f))nC 1 (0,f) : 0(0) = 0, and <j>'{r) > 0, Vr G (0, f)} , (8) 

where C([0, f]) (respectively, C 1 (0,f)) denotes the set of continuous functions on [0, f] (respec- 
tively, C 1 functions on (0,f)). 

Finally throughout this work, B(x,r) will stand for the usual open ball of center x and 
radius r > and B(x, r) will denote its closure. If if is a Hilbert space, its inner product will 
be denoted by (•, •) and the corresponding norm by || • ||. 

2 KL— inequality is a metric regularity condition 

Let X, Y be two complete metric spaces, F : X ^ Y a multivalued mapping and (x,y) G 
Graph F. Let us recall from [28|, Definition 1 (loc)] the following definition. 
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Definition 1 (metric regularity of multifunctions) . Let k G [0, +00). (i) The multivalued 
mapping F is called fc-metrically regular at (x, y) G Graph F, if there exist e, 6 > such that 
for all (x,y) G B(x,e) x B(y,6) we have 

dist(x,F- 1 (y)) < fcdist(y,F(x)). (9) 

(ii) Let V be a nonempty subset of X x Y. The multivalued mapping F is called ^-metrically 
regular on V, if .F is metrically regular at (x,y) for every (x,y) G Graphs n V. 

2.1 Metric regularity and global error bounds 

The following theorem is an essential result: it will show that Kurdyka-Lojasiewicz inequality 
and metric regularity are equivalent concepts (see Corollary U] and Remark [5|). The equivalence 
[(ii)<^(iii)] is due to Aze-Corvellec (see [H Theorem 2.1]). 

Theorem 2. Let X be a complete metric space, f : X — > M. U {+00} a proper lower semicon- 
tinuous function and ro > 0. The following assertions are equivalent: 

(i) The multivalued mapping 

X =4 K 
x 1 — ► [/(sc),+oo) 

is k-metrically regular on [0 < / < ro] x (0, ro) ; 

(ii) For all r £ (0, ro) and x G [0 < / < ro] 

dist(x,[/<r]) < k(f(x)-r) + ; (10) 

fmj For all x G [0 < / < r ] 

|V/|(x) > i 

Proof. The equivalence of (ii) and (iii) follows from [6l Theorem 2.1] and is based on Ekeland 
variational principle. Definition Q] (metric regularity of multifunctions) yields the following 
restatement for (i): 

For every (x,r) G Graph F with x G [0 < / < ro] and f G (0, ro), there exist e > and 
5 > such that 

(x, r) G (B(ss, e) D [0 < / < r ]) X [(f - 6, f + S) n (0, r )] =^> dist (x, [/ < r]) < fe (/(x) - r)+. 

(11) 

Clearly (i) (i)i. Now, in order to prove (i)i => (i), consider (x,r) G Graph F n [0 < 
/ < ro] x (0, ro). Take e and 5 positive given by such that < f — 5 < f + 25 < ro, 
£ < ^(^o — r — 25) and / is positive in B(x,e) (f is lower semicontinuous so [/ > 0] is open). 
For any (x, r) G B(x, e) x (f — 5, f + <5), we have r G (0, ro) and /(x) > 0. Thus if f{x) < ro by 
(1)1 we have 

dist (x, [/ < r]) < k(f(x) - r) + = k dist (r, F(x)). 
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If f(x) > ro, then 

dist (x, [f < r]) < dist (x, x) + dist (x, [f < r]) < e + k (f(x) - r)+ 

<e + k5 

< k(ro — f — 5) 

< k(r - r) 

< k(f(x) -r) + = k dist (r, F{x)). 

Thus (t)i (i). 

It is now straightforward to see that (it) ==> (i), thus it remains to prove that (i)x ==> (ii). 
To this end, fix any k' > k, r\ G (0, 7*0) and x\ € [/ = rj. We shall prove that 

dist (xi, [/ < s}) < k'(n - s), 

for all s G (0, n]. 

Claim 1 Let r G (0, ro) and x G [/ = r]. Then there exist r~ < r and x~ G [/ = r _ ] such that 

d(a;,ar) < k'(r-r~) (12) 

with 

dist (x, [/ < s]) < (r — s), for all s G [r~, r\. 

[Proof of Claim 1. Apply (i)x at (x, r) G Graph F to obtain the existence of p G (0, r) such that 
dist (x, [/ < s\) < k{r — s) for all s G [p,r]. Since k' > k there exists x~ G [/ < p] satisfying 

k' 

d(x, x~) < -r dist ( x ' [/ - PD' 

which in view of (jlip yields 

<i(x, x~) < k' (r — p). 
To conclude, set r~ = f(x~) < p and observe that for any s G [r~ , p] we have 

dist(x, [/ < a]) < d(x,x~) < k'(r-p) < k'(r - s) = k'(f(x) - s). 

This completes the proof of the claim. 

Let A be the set of all families {(xj,rj)}j g / C [/ < rj x M containing (xi,r%) such that 

- (Pi) /(xj) = rj for all i G / and rj 7^ rj, for i ^ j ; 

- (P2) If i,j £ I and rj < rj then d(xj,Xi) < k' (rj — r^). ; 

- (P3) For r* = infjrj : i G /} and for s G (r*,n] we have: 

dist (xi, [/ < s]) < A;'(ri — s). 

The set *4 is nonempty (it contains the one-element family {(xi,r\)}) and can be ordered by 

the inclusion relation (that is, J\ -< J2 if, and only if, J\ C J2). Under this relation A becomes 
a totally ordered set: every totally ordered chain in A has an upper bound in A (its union). 
Thus, by Zorn lemma, there exists a maximal element M = {(ajj, rj)}jg/ in A. 
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Claim 2. Any maximal element M = {(xj,rj)}j g / of A satisfies 

r*=infr;<0. (13) 
iei 

[Proof of the Claim 2. Let us assume, towards a contradiction, that (I13p is not true, i.e. r* > 0. 
Let us first assume that there exists j G I such that r* = rj. Define r~ := rj < rj and 
x~ = x~ G [/ = r~] as specified in Claim 1 and consider the family Mi = M U {(x~ ,r~)}. 
Then Mi clearly complies with (Pi). To see that Mi satisfies (P2), simply observe that for 
each i & I, 

d(x~ ,X{) < d(x~ ,Xj) + d(xj,Xi) < k'(ri — r~). 
Let s G [r~,rj]. By using the properties of the couple (x~,r~), one obtains 

dist (xi, [f < s]) < dist (x±,Xj) + dist (xj, [f < s]) < k'{r\ — rj) + k'(rj — s) < k'(r\ — s). 

This means that M\ G A which is contradicts the maximality of A4. 

Thus it remains to treat the case when the infimum r* is not attained. Let us take any decreasing 
sequence {ri„} n >i, i n G / satisfying n x = r\ and rj n \ r*. For simplicity the sequences {ri n } n 
and {xi n } n will be denoted, respectively, by {r n } n and {x n } n . Applying (P2) we obtain 

d(x n , x n+m ) < k' (r n - r n+m ). (14) 

It follows that {x n } n >i is a Cauchy sequence, thus it converges to some x*. Taking the limit as 
m — ► +00 we deduce from (|14p that d(x n , x*) < k' (r n —r*), for all n G N*. For any i G I, there 
exists n such that r n < n and therefore 

dist (x*,Xi) < d(x*,x n ) +d(x n ,Xi) < k'(ri - r*) < k'in - f(x*)), (15) 

where the last inequality follows from the lower semicontinuity of /. Set f(x*) = p* < r* and 
Mi = Mu{(x* , p*)}. Since the infimum is not attained in infjrj : i G 1} the family Mi satisfies 
(Pi). Further by using (fl~5j) . we see that Mi complies also with (P2)- Take s G [p*,r*\. Since 
x* G [/ < s], we have 

dist (xi, [f <s])< dist (xi,x*) < k'{n - r*) < k'(n - s). 

Hence Mi belongs to A which contradicts the maximality of M. (}] 

The desired implication follows easily by taking the limit as k' goes to k. This completes the 
proof. □ 

Remark 3 (Sublevel mapping and Lipschitz continuity). It is straightforward to see that state- 
ment (ii) above is equivalent to the "Lipschitz continuity" (see 0) of the sublevel set application 

(0,r ) ^ X 
r 1 — > [/ < r] 

for the Hausdorff "metric" given in ([7]). Note that i 7-1 is exactly the sublevel mapping given 
above, and thus in this context the Lipschitz continuity of F^ 1 is equivalent to the Aubin 
property of F" 1 , see [201 [28] . 
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2.2 Metric regularity and KL inequality 

As an immediate consequence of Theorem [2] and Remark 02 we have the following result. 

Corollary 4 (KL-inequality and sublevel set mapping). Let f : X — ► RU {+00} be a lower 
semicontinuous function defined on a complete metric space X and let ip £ /C(0, ro) (see 
The following assertions are equivalent: 

(i) the multivalued mapping 

\ x i-> [(<po/)(x),+oo) 

is k-metrically regular on [0 < / < ro] x (0, (f(ro)) ; 

(ii) for all n,r 2 € (0,r ) 

Dist ([/ < n], [/ < r 2 ]) < k \<p( ri ) - <p(r 2 )\ ; 
(Hi) for all x £ [0 < / < ro] 

\V(ipof)\(x)>± 
It might be useful to observe the following: 

Remark 5 (Change of metric). Let ip G /C(0, ro) and assume that it can be extended contin- 
uously to an increasing function still denoted ip : M + — ► M+. Set d v (r,s) = \<p{r) — <p(s)\ for 
any r,s£ and assume that M+ is endowed with the metric eL,. Endowing R + with this new 
metric, assertions (i), (ii) and (iii) can be reformulated very simply: 

(i ') The multivalued mapping 

f X =t R + 

\ x ^ [/(x),+oo) 

is fe-metrically regular on [0 < / < ro] x (0, ro). 
(ii') The sublevel mapping 

R + 9i-h[/< r], 

is k Lipschitz continuous on (0, ro). 
(iii') For all x G [0 < / < r ] 

|V^/|(x)>i 

where |V^/| denotes the strong slope of the restricted function / : [0 < /] — ► [R+,cy. 

Given a lower semicontinuous function / : X — ► K U {+00} we say that / is strongly 
slope-regular, if for each point x in its domain dom / one has 

|V/|(x) = |V(-/)|(x). (16) 

Note that all C 1 functions are strongly slope-regular according to the above definition. 
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Proposition 6 (Level mapping and Lipschitz continuity). Assume f : X — ► K is continuous 
and strongly slope-regular. Then any of the assertions (i)-(iii) of Theorem\^is equivalent to the 
fact that the level set application 

( R =t X 

\ r i ^ [/ = r] 

is Lipschitz continuous on (0,ro) with respect to the H aus dor ff metric. 

Proof. The result follows by applying Theorem [2] twice. (Details are left to the reader.) □ 
Let us finally state the following important corollary. 

Corollary 7 (KL-inequality and level set mapping). Let f : X — > M be a continuous function 
which is strongly slope-regular on [0 < / < ro] and let ip G /C(0, ro) (recall Then the 

following assertions are equivalent: 

(i) ip o / is k-metrically regular on [0 < / < ro] x (0, (p(ro)); 

(ii) for all ri,r 2 € (0,r ) 

Dist ([/ = n], [/ = r 2 ]) < fc Mn) - ^(r 2 )|; 
(Hi) for all x G [0 < / < ro] 

|V(¥>o/)|(s)>i 

Proof. It follows easily by combining Theorem [2] with Proposition [6l □ 

3 KL— inequality in Hilbert spaces 

From now on, we shall work on a real Hilbert space [H, (•, •)]. Given a vector x in H, the norm 
of x is defined by ||x|| = y (x, x) while for any subset C of H , we set 

||C||_ = dist (0,C) =inf{||x|| : x G C} G EU{+oo}. (17) 

Note that C = implies ||C||_ = +oo. 

3.1 Elements of nonsmooth analysis 

Let us first recall the notion of Frechet subdifferential (see |13|. I36|). 

Definition 8 (Frechet subdifferential). Let / : H — > R U {+00} be a real-extended- valued 
function. We say that p G H is a (Frechet) subgradient of / at x G dom / if 

liminf /(*)-/(*)-&*/-*> >q, 

y^x, y^x ||y — CC|| 

We denote by df(x) the set of Frechet subgradients of / at x and set df(x) = for x ^ dom/. 
Let us now define the notion of critical point in variational analysis. 

Definition 9 (critical point /values), (i) A point xq G H is called critical for the function /, if 
G <9/(x ). 

(ii) The value r G f{H) is called a critical value, if [/ = r] contains at least one critical point. 
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In this section we shall mainly deal with the class of semiconvex functions. Let us give the 
corresponding definition. (The reader should be aware that the terminology is not yet completely 
fixed in this area, so that the notion of semiconvex function may vary slightly from one author 
to another.) 

Definition 10 (semiconvexity) . A proper lower semicontinuous function / is called semiconvex 
(or convex up to a square) if for some a > the function 

x i — ► f(x) + -||x|| 

is convex. 
Remark 11. 

(i) For each x € H, df{x) is a (possibly empty) closed convex subset of H and df{x) is nonempty 
for x G int dom /. 

(ii) It is straightforward from the above definition that the multivalued operator x i — ► df(x)+ax 
is (maximal) monotone (see [421 Definition 12.5] for the definition). 

(iii) For general properties of semiconvex functions, see [2]. Let us mention that Definition [TOl 
is equivalent to the fact that 

f(y) ~ f(x) >{p,y-x)- a\\x - y\\ 2 , (18) 
for all x,y £ H and all p £ df(x) (where a > 0). 

(iii) According to Definition [TOl semiconvex functions are contained in several important classes 
of (nonsmooth) functions, as for instance ^-convex functions ([32]), weakly convex functions 
([!]) and primal-lower-nice functions ([M]). Although an important part of the forthcoming 
results is extendable to these more general classes, we shall hereby sacrifice extreme generality 
in sake of simplicity of presentation. 

Given a real-extended- valued function / on H, we define the remoteness (i.e., distance to 
zero) of its subdifferential df at x £ H as follows: 

||9/(x)||_ = inf ||p|| = dist (0, df(x)). (remoteness) 

pe8f(x) 

Remark 12. (minimal norm) 

(i) If df(x) 0, the infimum in the above definition is achieved since df(x) is a nonempty 
closed convex set. If we define d°f(x) as the projection of on the closed convex set df(x) we 
of course have 

\\df{x)\\. = \\d°f{x)\\. (19) 
Some properties of H 3 x i— > ||(9/(x)||_ are given in Section [5] (Annex) . 

(ii) If / is a semiconvex function, then ||5/(rr)||_ coincides with the notion of strong slope 
|V/|(x) introduced in ([6]), see Lemma [42] (Annex) . 
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3.2 Subgradient curves: basic properties 



Let / : H — > R U {+00} be a proper lower semicontinuous semiconvex function. The purpose 
of this subsection is to recall the main properties of the trajectories (subgradient curves) of the 
corresponding differential inclusion: 

Xx(t) G -df(xx(t)) a.e. on (0,+oo), 
Xx(0) = x G dom/. 

The following statement aggregates useful results concerning existence and uniqueness of 
solutions. These results are essentially known even for a more general class of functions (see 
\34:\ Theorem 2.1, Proposition 2.14, Theorem 3.3] for instance for the class of primal-lower-nice 
functions). It should also be noticed that the integration of measurable curves of the form 
R 3 t — > j(t) G H relies on Bochner integration/measurability theory (basic properties can be 
found in |llj). 

Theorem 13 (subgradient curves). For every x G domf there exists a unique absolutely con- 
tinuous curve (called trajectory or subgradient curve) Xx '■ [0, +oo) — ► H that satisfies 

Xx(t) G —df(xx{t)) a- e. on (0,+oo), 

(20) 

Xx(0) = x G domf. 
Moreover the trajectory satisfies: 



(i) Xx(t) G domdf for all t G (0, +oo). 

(ii) For all t > 0, the right derivative Xx{i + ) of Xx is well defined and equal to 

Xx(t + ) = -d°f( X x(t)). 

In particular Xx{t) = —d°f(xx(t)), for almost all t. 

(Hi) The mapping t i— > \\df(x x (t))\\- is right- continuous at each t G (0, +oo). 

(iv) The function t i — > f(Xx(t)) is nonincreasing and continuous on [0, +oo). Moreover, for 
all t, r G [0, +oo) with t < r, we have 



f(Xx(t))-f(Xx(r)) > J \\ X x(u)\\ z du, 

and equality holds if t > 0. 
(v) The function 1 1 — > f{Xx{i)) is Lipschitz continuous on [rj, +oo) for any n > 0. Moreover 

^/(XxW) = -\\Xx(t)\\ 2 a.e on (t/,+oo). 
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Proof. The only assertion that does not appear explicitly in [M] is the continuity of the function 
/ ° Xx at t = when x £ dom/\dom9/, but this is an easy consequence of the fact that / is 
lower semicontinuous, Xx is (absolutely) continuous and / o^ is decreasing. For the rest of the 
assertions we refer to 1 341. □ 



The following result asserts that the semiflow mapping associated with the differential in- 
clusion (f20l) is continuous. This type of result can be established by standard techniques and 
therefore is essentially known (see |11[ [34"] for example). We give here an outline of proof (in 
case that / is semiconvex) for the reader's convenience. 

Theorem 14 (continuity of the semiflow). For any semiconvex function f the semiflow mapping 

R + x dom / H 
(t,x) i ^ Xx(t) 

is (norm) continuous on each subset of the form [0, T] x (-8(0, R) D [f < r]) where T, R > and 
r £ K. 

Proof. Let us fix x,y £ dom / and T > 0. Then for almost all t £ [0, T], there exist p{Xx{t)) £ 
df(Xx(t)) and q(x y {t)) £ df{x y {t)) such that 

J t \\Xx(t) - X y(t)\\ 2 = 2(X«(<) " Xy(t),Xx(t) ~ Xy(t)) = -2( X x(t) ~ Xy(t) , p(Xx(t)) ~ g(x„ (*))>• 
It follows by (TED that 



^llXx(t) - XyWII 2 < 2a|| X x(0 - X,(t)H 2 , 
which implies (using Gronwall's lemma) that for all < t < T we have 

I|X*(*) " Xy(t)\\ 2 < exp(2aT)||rr - y\\ 2 . (21) 
For any < t < s < T, using Cauchy-Schwartz inequality and Theorem [TBI we deduce that 



Xx(s) - X x(t)\\ < / \\Xx{T)\\dT<V7=tJ \\Xx{r)\?dT< V^t^/fixj. (22) 



The result follows by combining (|2ip and (|22[) . □ 

Let us introduce the notions of a piecewise absolutely continuous curve and of a piecewise 
subgradient curve. This latter notion, due to its robustness, will play a central role in our study. 

Definition 15. Let a,b £ [— oo, +oo] with a < b. 

(Piecewise absolutely continuous curve) A curve 7 : (a,b) — > -ff is said to be piecewise absolutely 
continuous if there exists a countable partition of (a, 6) into intervals such that the restriction 
of 7 to each 1^ is absolutely continuous. 

(Length of a curve) Let 7 : (a, 6) — > H be a piecewise absolutely continuous curve. The length 
of 7 is defined by 



length [7] := I || 7 (t)|| dt. 

J a 
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(Piecewise subgradient curve) Let T E (0, +00]. A curve 7 : [0, T) — ► H is called a piecewise 

subgradient curve for (|20l) if there exists a countable partition of [0, T] into (nontrivial) intervals 
Ik such that: 

- the restriction of 7 to each interval Ik is a subgradient curve ; 

- for each disjoint pair of intervals Ik,Ii, the intervals f(j{Ik)) an d f(j(Ii)) have at most 
one point in common. 

Note that piecewise subgradient curves are piecewise absolutely continuous. Observe also 
that subgradient curves satisfy the above definition in a trivial way. 

3.3 Characterizations of the KL-inequality 

In this section we state and prove one of the main results of this work. Let / : H — ► R U {+00} 
and x £ [/ = 0] be a critical point. Throughout this section the following assumptions will be 
used: 

- There exist f, e > such that 

x £ B(x, e) fl [0 < / < f] => ^ df(x) (0 is a locally upper isolated critical value). 

(23) 

- There exist r, e > such that 

B(x,e) H [/ < f] is (norm) compact (local sublevel compactness). (24) 

Remark 16. 

(i) The first condition can be seen as a Sard-type condition. 

(ii) Assumption (I24p is always satisfied in finite-dimensional spaces, but is also satisfied in several 
interesting cases involving infinite-dimensional spaces. Here are two elementary examples. 

(ii) 1 The (convex) function / : £ 2 (N) — > R defined by 

f(x) = ^n 2 xi 
n>l 

has compact lower level sets. 

(ii)2 Let g : R — > R U {+00} be a proper lower semicontinuous semiconvex function and let 
$ : L 2 (n) -» RU {+00} be as follows ([TO]) 

= / 5/J|Vx|| 2 _+Jn<7(x) if^G^) 
I +cxd otherwise. 

The above function is a lower semicontinuous semiconvex function and the sets of the form 
[$ < r] n B(x,R) are relatively compact in L 2 (f2) (use the compact embedding theorem of 

h 1 ^) ^ L 2 (n)). 

As shown in Theorem 118} Kurdyka-Lojasiewicz inequality can be characterized in terms of 
boundedness of the length of "worst (piecewise absolutely continuous) curves", that is those 
defined by the points of less steepest descent. 
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Definition 17 (Talweg/Valley). Let x E [/ = 0] be a critical point of / and assume that (I23p 
holds for some f, e > 0. Let L> be any closed bounded set that contains B(x,e) n [0 < / < f]. 
For any R > 1 the .R-valley Vit(-) of / around x is defined as follows: 

V R (r) = Le[f = r]nD:\\df(x)\\.<R inf | |_ ) , for all r E (0, f]. (25) 

A selection : (0, f] — >• H of Vr, i.e. a curve such that 9(r) £ Vr(t), Vr £ (0, f], is called an 
iJ-talweg or simply a talweg. 

We are ready to state the main result of this work. 

Theorem 18 (Subgradient inequality - local characterization). Let f : H — > R U {+00} be a 
lower semicontinuous semiconvex function andx £ [/ = 0] be a critical point. Assume that there 
exist e, f > such that (|23[) and (|24p ZioZd. 

Then, the following statements are equivalent: 

(i) [Kurdyka-Lojasiewicz inequality] There exist r^ £ (0,f), e £ (0, e) and ip £ /C(0, ro) suc/i 

||0(<po/)(x)||_ > 1, /or aIixeB(i,e)n[0</<ro]. (26) 

(ii) [Length boundedness of subgradient curves] There exist ro £ (0, f), e £ (0, e) and 

a strictly increasing continuous function a : [0, ro] — ► [0, +00) with a(0) = such that for all 
subgradient curves Xx of $2fy) satisfying Xz([0, T)) C B(x,e) n [0 < / < ro] (T £ (0, +oo]J we 
/iaue 

/ ||x^)||di<^(/(x))-a(/( X ,(T))). 

JO 

(raj [Piecewise subgradient curves have finite length] There exist ro £ (0, f), e £ (0, e) 

and M > suc/i that for all piecewise subgradient curves 7 : [0,T) —* H of [20]) satisfying 
7([0, T)) C e) n [0 < / < r ] (T £ (0, +00]; we have 

length [7] := / \\j(T)\\dT < M. 
Jo 

(iv) [Talwegs of finite length] For every R > 1, there exist ro £ (0, f), e £ (0, e), a closed 
bounded subset D containing B(x,e) n [0 < / < ro] and a piecewise absolutely continuous curve 
9 : (0,ro] — > H of finite length which is a selection of the valley Vn(r), that is, 

9(r) £ V R (r), for all r £ (0,r ]. 

(v) [Lntegrability condition] There exist ro £ (0, f) and e £ (0, e) suc/i i/iaf the function 

U(r) = inf L P7RP rG(0 ' r ° ] 

i6B(i,e)n[/=r] 

is finite-valued and belongs to L 1 (0, ro). 
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Remark 19. 

(i) As it appears clearly in the proof, statement (iv) can be replaced by {iv') "There exist 
R > 1, ro £ (0, f), e £ (0, e), a closed bounded subset .D containing B(x, e) n [0 < / < ro] and a 
piecewise absolutely continuous curve : (0, ro] — > H of finite length which is a selection of the 
valley Vfl(r), that is, 

0(r) £ Vfl(r), for all r £ (0,r ]". 

(ii) The compactness assumption ([21]) is only used in the proofs of (iii) =>■ (ii) and (ii) =>■ (iv). 
Hence if this assumption is removed, we still have: 

(iv) =>• (iv') (v) -4=>- (i) ==> (ii) ==> (iii). 

(iii) Note that (i) implies condition (j23|) . This follows immediately from the chain rule (see 
Annex, Lemma H3|) . 

Proof of Theorem 1181 [(i)=^(ii)] Let e,ro,<p be as in (i) such that (|26j) holds. Let further Xx 
be a subgradient curve of (f20|) for x £ [0 < / < ro] and assume that Xx([0, T)) C -B(x, e) D [0 < 
/ — r o] f° r some T > 0. 

Let us first assume that x £ dom<9/. Since ip is C 1 on (0, ro), by Theorem I13fv) and Lemma H3l 
(Annex) we deduce that the curve t \— > <p(f(Xx(t)) is absolutely continuous with derivative 

j t (V°f°X*)(t) = -v'(f(x*(t))\\Mt)\\ 2 a-e. on (0,T). 
Integrating both terms on the interval (0, T) and recalling (|26|) . Xx(0) =iwe get 

<p(f(x)) - <p(f(Xx(T))) = - J j t (v o / o Xx )(t)dt 

T v'(f(xS))\\xS)\\ 2 dt> f T \\ Xx (t)\\dt. 

Jo 

Thus (ii) holds true for a := (p and for all subgradient curves starting from points in dom<9/. 
Let now x £ dom/\dom9/ and fix any 5 £ (0,T). Since % x ([(5, T]) C dom<9/ we deduce from 
the above that 

^ \\Xx(t)\\dt < a(f( X x(S)) - a(f( X x(T))). 

Thus the result follows by taking 5 \ + and using the continuity of the mapping 1 1 — ► f(Xx(t)) 
at (Theorem [TBT ii)). 

[(ii)=^(iii)] Let 7 be a piecewise subgradient curve as in (iii) and let I k be the associated 
partition of [0, T] (cf. Definition [T5|) . Let {a k } and {b k } be two sequences of real numbers such 
that int I k = (a^, b k ). Since the restriction 7^ of 7 onto I k is a subgradient curve, applying (ii) 
on (a k ,b k ) we get 

length [ 7 | 7 J < a(f( 7 (a k ))) - a(f(j(b k ))). 

Let m be an integer and I kl , ■ ■ ■ , I km a finite subfamily of the partition. We may assume that 
these intervals are ordered as follows < a kl < b kl < ■ ■ ■ < a km < b km ■ Hence 

m 

£ WM"h))) - ^(/(7(6fci)))] < v(f(-yM)) < a(r ). 
1 
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Thus the family {f (/(7(afe))) ~~ a (f(lQ } k)))} is summable, hence using the definition of Bochner 
integral (see (TTJ) 

length [7] = ^length [7 1 j J < a(r ). 

ken 

[(iii)=^(ii)] Let e,ro be as in (iii), pick any < r' < r < r$ and denote by T r ^ r the (nonempty) 
set of piecewise subgradient curves 7 : [0, T) — > H (where T G (0, +00]) such that 

j([0,T))(ZB(x,e)n[r l <f<r}. 

Note that, by Theorem [T3lfiv) and Proposition I41(iii). T = +00 is possible only when r' = 0. 
Set further 

ip(r',r):= sup length^] and a{r) := ipiQ^r). 
yev rl>r 

Note that (iii) guarantees that tp and a have finite values. We can easily deduce from Defini- 
tion [15] that 

tj)(0,r') + ij)(r',r) = il)(0,r). (27) 

Thus for each x G B(x, e) n [0 < / < r ] and T > such that X x([0, T\) C B(x, e) H [0 < / < r ), 
we have 

||x,(r)||dr + a(/(xx(T)) < tr(/(x)). (28) 





Since the function a is nonnegative and increasing it can be extended continuously at by setting 
<r(0) = lim^oc^) > 0. Since the property ([28]) remains valid if we replace cr(-) by cr(-) — cr(0), 
there is no loss of generality to assume c(0) = 0. 

To conclude it suffices to establish the continuity of a on (0, ro]. Fix f in (0, ro) and take a 
subgradient curve x '■ [0, T 1 ) — > satisfying x([0,T)) C B(x,e) n [/ < ro], where T G (0, +00]. 
Set /(x(0)) = r an d limt^T f{x{t)) = r ' an d assume that f < r' < r < tq. 

From Theorem [13lfiv) and Proposition l4TTiii) (Annex), we deduce that T < +00 so that 
x([0, T]) C B(x,e) n [r' < / < r]. Using assumption ([23]) together with Theorem [T3l (i) . (v) . we 
deduce that the absolutely continuous function / o X '■ [0,T] — > [r',r] is invertible and 

4-lf ° xl'Hp) = ipTH , M2 > . . ~ l „| 2 == ~K, (29) 

d P \\x([f°X\ 1 (pW mf \\df(x)\\t 

x€B(x,e)n[r<f<r ] 

for almost all p G (r, r'). By Proposition I41( iii) (Annex) we get that K < +00 and therefore 
the function p 1 — ► [/ o x] _1 (/o) is Lipschitz continuous with constant K on [r 1 , r]. Using the 
Cauchy-Schwarz inequality and Theorem I13(iv) we obtain 



T / T i T 

length [ X ] = J llxll < vW^ ||x|| 2 = Vlf ° x]" 1 (0 - [/ o X ]"V) ^ 







< A/K(r - r')Vr - r' = VF(r - r'). 

This last inequality implies that each piecewise subgradient curve 7 : [0, T) — ► if such that 
7QO, T)) C e) n [r' < f < r] satisfies 

length [7] < VK(r — r'), 
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thus using (I27p we obtain a(r) — a(r') < yK{r — r'), which yields the continuity of a. 

[(ii)=^(iv)] Let us assume that (ii) holds true for e and ro- In a first step we establish the 
existence of a closed bounded subset D of [0 < / < ro] satisfying 

x E D, t > 0, /(**(*)) > =► X x(t) E J5. (30) 

Let ro > ri > be such that cr(r\) < e/3 and let us set 

D := {y E e) n [0 < / < n] : 3x E e/3) n [0 < / < n], 3t > such that X x(t) = y}. 

Let us first show that D enjoys property (|3U|) . It suffices to establish that 

x E B(x, e/3) n [0 < / < n], t > 0, /(**(*)) > =► Xx(<) E D. 

To this end, fix x E B(x,e/3) fl [0 < / < ri]. By continuity of the flow, we observe that 
Xx(t) E B(x, e) for small t > and for all i > such that Xx([0, *]) C B(x, e) with f(Xx(t)) > 0, 
assumption (ii) yields 

||x*(i)-*|| < \\Xx{t) - x\\ + \\x - x\\ < [ \\ Xx (r)\\dT + e/3 < a(n) + e/3 < 2e/3. (31) 

JO 

Thus D satisfies ([30]) and 5(z, e/3) D [/ < n] C D. 

Let us now prove that D is (relatively) closed in [0 < / < r\\. Let E D be a sequence 
converging to y such that f(y) E (0, n]. Then there exist sequences {x n } n C B(x,e/3) D [0 < 
/ < ri] and {t n } n C M + such that Xx n (t n ) = y n - Since / is lower semicontinuous, there exists 
no E N and r/ > such that f(y n ) > w for all n > uq. By Theorem H3T ii).(iv). (p3|) and 
Proposition I41lfiii) (Annex), we obtain for all n > tiq 

0<tn inf. ||9/(z)|| 2 _ < / \\xx n (t)\\ 2 dt < f(x n ) < n . 

The above inequality shows that the sequence {t n } n is bounded. Using a standard compactness 
argument we therefore deduce that, up to an extraction, and t n —> t for some x E 

B(x, e/3) n [/ < n] and t E ffi+. Theorem [T^l (continuity of the semiflow) implies that y = Xx(i) 
and consequently that f{x) > /(y) > 0, yielding that y E D. This shows that D is (relatively) 
closed in [0 < / < ro]. 

Now we build a piecewise absolute continuous curve in the valley. According to the notation 
of Proposition HH (Annex) we set 

s D (r) := inf{||9/(x)||_ : x E D n [/ = r]}, 

so that for any R > 1 the R- valley around x (cf. Definition [T7|) is given by 

V R (r):={xe[f = r)nD: \\df(x)\\_ < Rs D (r)}. 

If B(x,e/3) n [/ = r] =0 for all < r < ri, there is nothing to prove. Otherwise, there 
exists < T2 < ri and ^2 E B(x,e/3) fl [/ = C D. From Theorem 1131 and Proposition 
EnTiii) (Annex), we deduce that Xx 2 (*) E [/ = f(Xx 2 (t))] Hfln dome?/ for all t > such that 
[/ o Xx- 2 ](0 > an d that the inverse function [/ o Xx 2 ]~ 1 (') 1S defined on an interval containing 
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(0,r 2 ). In other words the set [/ = r) n D n domdf is nonempty for each r G (0,r2), which in 
turn implies that the valley is nonempty for small positive values of r, i.e. Vr(t) ^ for all 
r G (0,r2). With no loss of generality we assume that VR{r 2 ) 7^ 0. 

Let further R' G (1,1?) and x G [/ = r 2 ] flDbe such that < R' s D (r 2 ) (therefore, 

in particular, x G V/j(r2)). Take p G (R',R). Since the mapping t 1 — ► ||t?/(x x (i)||_ is right- 
continuous (c/. Theorem [T37 iii)). there exists to > such that ||9/(xa;(i)||- < psD{f2) for all 
t G (0,to)- On the other hand t 1 — > S£){f(Xx(t)) is lower semicontinuous (c/. Proposition HTJ- 
Annex), hence there exists ii G (0,io) such that Rs£>(f(Xx(t)) > psD(r 2 ), for all t G (0,i2)- 
Using the continuity of the mapping Xx(') and the stability property (I30p . we obtain the existence 
of t 2 > such that 

Xx(t) G Vfl(/(x(t)) for all t G [0, i 2 ). (32) 

By using arguments similar to those of [(iii)=>-(ii)] we define the following absolutely continuous 
curve: 

(/ o Xx(t 2 ),r 2 ] 3 r ,— > 0(r) = X x([f ° Xx]~Hr)) E Df\[f = r]. 

By Proposition 06] based on Zorn's Lemma (see Annex), we obtain a piecewise subgradient curve 
that we still denote by 9, defined on (0, r 2 ], satisfying 9(r) G Vr(t) for all r G (0, r 2 ]. Assumption 
(iii) now yields 

length [0] < M < +00, 
completing the proof of the assertion. 

[(iv)=^-(v)] Fix R > 1 and let e,ro and 9 : (0,ro] — > H be as in (iv). Applying Lemma 03] 
(Annex), we get 

^j-(f o Q)(r) = I = (9(r),p(r)) a.e on (0, ro], for all p(r) G df(9(r)). 

Using the Cauchy-Schwartz inequality together with the fact that Df] [f = r] D B(x, e)D [/ = r], 
we obtain 

R\\0(r)\\ > u{r) = — 1 ,,,,,, , 

for almost all r G (0, ro]. Since has finite length we deduce that u G L 1 ((0, ro). 

[(v)=>(i)] Let e, ro and u be as in (v). From Proposition 01] (Annex) we deduce that u is 
finite- valued and upper semicontinuous. Applying Lemma 0H (Annex) we obtain a continuous 
function u : (0,ro] — ► (0, +00) such that u{r) > u(r) for all r G (0,ro]. We set 

/•r 

y( r ) = / u(s)ds. 



Jo 

It is directly seen that ip(0) = 0, 92 G C([0,r]) n C^O,^) and ^'(r) > for all r G (0,r ). Let 
a; G B(x,e)n[/ = r] and g G d(ipo f)(x). From LemmaHBI (Annex) we deduce p := G df(x), 
and therefore 

Nl = v'WII-jfrll > «Wlbll >i- 

The proof is complete. □ 
Under a stronger compactness assumption Theorem [18] can be reformulated as follows. 
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Theorem 20 (Subgradient inequality - global characterization). Let f : H — ► R U {+00} be a 
lower semicontinuous semiconvex function. Assume that there exists ro > such that 

[f < f"o] i s compact and ^ df(x), Vx 6 [0 < / < ro]. 

Then the following propositions are equivalent 
(i) [Kurdyka-Lojasiewicz inequality] There exists a (p G /C(0, ro) suc/i that 

\\d(<p o /)(x)||_ > 1, for allxe[0<f< r ]. 

(m^ [Length boundedness of subgradient curves] There exists an increasing continuous 
function a : [0, ro) — ► [0, +00) with cr(0) = such that for all subgradient curves Xx(-) (where 
x G [0 < / < ro]) we have 

[ T \\xAt)\\dt < a(f(x))-a(f( Xx (T))), 
Jo 

whenever f(xx(T)) > 0. 

(in) [Piecewise subgradient curves have bounded length] There exists M > such that 
for all piecewise subgradient curves 7 : [0, T) — ► H such that 7Q0, T)) C [0 < / < ro] we /iave 

length [7] < M. 

(iv) [Talwegs of finite length] For all R > 1, ^/iere exists a piecewise absolutely continuous 
curve (with countable pieces) 9 : (0, ro) — > R n with finite length suc/i t/iai 



9(r) E <x E [f = r] : \\df(x)\\- < R inf > , for all r G (0,r ). 

I 2/e[/=n J 

('uj [Lntegrability condition] The function u : (0, ro) — ► [0, +00] defined by 

mf ||d/(x)||_ 

xe[f=r] 

is finite-valued and belongs to L 1 (0, ro). 

(vi) [Lipschitz continuity of the sublevel mapping] There exists ip G /C(0, ro) siic/i i/iai 

DiBt([/<r], [/<«])< b(r)-^(s)| /oraHr, S e(0,r ). 

Proof The proof is similar to the proof of Theorem [18] and will be omitted. The equivalence 
between (i) and (vi) is a consequence of Corollary [H □ 

3.4 Application: convergence of the proximal algorithm 

In this subsection we assume that the function / : H — > R U {+00} is semiconvex (cf. Defi- 
nition [10]). Let us recall the definition of the proximal mapping (see \42\ Definition 1.22], for 
example) . 
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Definition 21 (proximal mapping). Let AG (0, a 1 ). Then the proximal mapping prox A : H — > 
H is defined by 

prox A (x) := argmin |/(y) + ~ x l| 2 | , Vx E H. 

Remark 22. The fact that prox A is well-defined and single-valued is a consequence of the 
semiconvex assumption: indeed this assumption implies that the auxiliary function appearing 
in the aforementioned definition is strictly convex and coercive (see |42j . |14j for instance). 

Lemma 23 (Subgradient inequality and proximal mapping). Assume that f : H — > RU {+00} 
is a semiconvex function that satisfies (i) of Theorem\2Q Let x G [0 < / < r ] be such that 
/(prox A x) > 0. Then 

||prox A x -x\\< <p(f(x)) - </?(/(prox A a;)). (33) 

Proof. Set x + = prox A (x), r = f(x), and r + = f(x + ). It follows from the definition of x + that 
< r + < r < tq. In particular, for every u 6 [/ < r + ] we have 

\\x + — x\\ 2 < \\u — x\\ 2 + 2\[f{u) — r + ] < \\u — x\\ 2 . 

Therefore by Corollary U] (Lipschitz continuity of the sublevel mapping) we obtain 

\\x+ -x\\ = dist (x, [f < r+]) < Dist ([/ < r], [f < r + ]) < ip(r) - v?(r + ). 

The proof is complete. □ 

The above result has an important impact in the asymptotic analysis of the proximal al- 
gorithm (see forthcoming Theorem \2A\i . Let us first recall that, given a sequence of positive 
parameters {Xk} C (0,a _1 ) and x S H the proximal algorithm is defined as follows: 



F x fc+1 = prox Afc y x fc , Y x ° = x, 



or in other words 



{Y x k+1 } = argmin + —\\u - Y x k \\ 2 j , Y x ° = x. 

If we assume in addition that inf / > —00, then for any initial point x the sequence {f(Y x )} is 
decreasing and converges to a real number L x . 

Theorem 24 (strong convergence of the proximal algorithm). Let f : H — > R U {+00} be 
a semiconvex function which is bounded from below. Let x G domf, {Xk} C (0, a -1 ) and 
L x := lim f(Y x ) and assume that there exists ko > and ip G /C(0, f(Y x a ) — L x ) such that 

k— >oo 

||%> o [/(•) - L x ]){x)\\. > 1, for all x G \L X < f < f(Y x )]. (34) 
Then the sequence {Y x } converges strongly to 1^°° and 

\\Y£° - Y x k \\ < <p(f(Y x k ) - L x ), for all k > k . (35) 
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Proof Since the sequence {Y£}k>k evolves in L x < f < f(Y^°), Lemma |2"31 applies. This yields 

\\Y x k+1 ~ Y x k \\ < ^f(Y^) " L.) - v{f{Yi°) ~ L x ), 

k=p 

for all integers ko < p < q. This implies that Y x converges strongly to Y£° and that inequality 
(1351) holds. □ 



Remark 25 (Step-size). "Surprisingly" enough the step-size sequence {Afc} does not appear 
explicitly in the estimate ([35]) . but it is instead hidden in the sequence of values {f{Y™)}. In 
practice the choice of the step-size parameters Afc is however crucial to obtain the convergence of 
{f{Y h )} to a critical value; standard choices are for example sequences satisfying ^ Afc = +oo 
or Afc G [r], a -1 ) for all k > where rj G (0, a -1 ), see |14j for more details. 



4 Convexity and KL-inequality 

In this section, we assume that / : H — > K U {+00} is a lower semicontinuous proper convex 
function such that inf / > — 00. Changing / in / — inf /, we may assume that inf / = 0. Let us 
also denote the set of minimizers of / by 

C := argmin/ = [/ = 0]. 

When C is nonempty, we may assume with no loss of generality that G C. 

In this convex setting Theorem [13] can be considerably reinforced; related results are gathered 
in Section 14.11 We also recall well-known facts ensuring that subgradient curves have finite 
length and provide a new result in that direction (see Theorem [28]). In Section |4. 21 we give some 
conditions which ensure that / satisfies the KL-inequality and we show that the conclusions of 
Theorem [20] can somehow be globalized. In section 1*01 we build a counterexample of a C 2 convex 
function in M 2 which does not satisfy the KL-inequality. This counterexample also reveals that 
the uniform boundedness of the lengths of subgradient curves is a strictly weaker condition than 
condition (iii) of Theorem [18] which justifies further the introduction of piecewise subgradient 
curves. 

4.1 Lengths of subgradient curves for convex functions 

The following lemma gathers well known complements to Theorem 1131 when / is convex. 

Lemma 26. Let f : H — > M. U {+00} be a lower semicontinuous proper convex function such 
that G C = [f = 0]. Let x G dom/. 

(i) If a G C, then 

^Hx«o(t) " o-l I 2 < -2/(x*o(*)) < a.e on (0,+oo). 
and therefore t 1— > ||Xx (^) — a ll nonincreasing. 
(ii) The function 1 1— ► f(Xx (t)) is nonincreasing and converges to = min/ as t — > +00. 
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(Hi) The function t G [0, +00) 1 — > \ \df(xx (t)\\- is nonincreasing. 

(iv) The function t 1— ► f(Xxo(t)) * s convex and belongs to i 1 ([0, +00)); for all T > 0, 

£ f(Xx (t))dt = ±\\x \\ 2 - l\\Xx (T)\\ 2 < ±\\x \\ 2 . (36) 

(v) For all T > 0, 

rT / r +oo \ 1/2 

J o \\Xx Q (t)\\dt<U f(Xx (t))dtj (logT) 1 ^. (37) 

Proof. The proofs of these classical properties can be found in |11[ 112] . □ 

R. Bruck established in [12] that subgradient trajectories of convex functions are always 
weakly converging to a minimizer in C = argmin/ whenever the latter is nonempty. However, 
as shown later on by J.-B. Baillon (7j, strong convergence does not hold in general. 

To the best of our knowledge, the problem of the characterization of length boundedness of 
subgradient curves for convex functions is still open (see |11[ Open problems, p. 167]). In the 
present framework, the following result of H. Brezis [101 [TT] is of particular interest. 

Theorem 27 (Uniform boundedness of trajectory lengths p2]). Let f : H — > R U {+00} be a 
lower semicontinuous proper convex function such that G C = argmin / = [/ = 0] . We assume 
that C has nonempty interior. Then, for all xq G dom/, Xx (') has finite length. More precisely, 
if B(0,p) C C, we have, for all T > 0, 

[ T \\Xx m\dt<^(\\x \\ 2 -\\Xx (T)\\ 2 ). 
Jo Z P 

Proof. We assume that B(0,p) C C for some p > and consider xo G dom/\C (otherwise 
there is nothing to prove). Let t > such that Xx (t) G" C and Xx {t) exists. By convexity, we 
get 

(-(Xx (t)-pu),Xx a (t)) > f(Xx (t)) ~ f(pu) > 

for all u in the unit sphere of H. As a consequence —{Xxo(t)iXxo(t)) > Pi IXa;o (*) 1 1- Therefore 
Io\\Xx (t)\\dt<^ p (\\x \\ 2 -\\xx (T)\\ 2 ). □ 

The following result is an extension of Theorem [27] under the assumption that the vector 
subspace span(C) generated by C, has codimension 1 in H. We denote by ri(C) the relative 
interior of C in span(C). 

Theorem 28. Let f : H — > R U {+00} be a lower semicontinuous proper convex function such 
that G C = argmin / = [/ = 0]. Assume that C generates a subspace of codimension 1 and 
that the relative interior ri(C) of C in span(C) is not empty. If xq G dom f is such that Xx Q {t) 
converges (strongly) to a £ ri(C) as t — > +00, then length [Xx ] < +00. 

Proof. Let us denote by a the limit point of x(i) := Xx (t) as t goes to infinity. By assumption 
a belongs to ri(C), so that there exists 5 > such that B(a,S) H span(C) C C. Let T > be 
such that X (t) € B(a,S) for all t > T. Write span(C) = {x G H : (x,x*) = 0} with x* G H. We 
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claim that the function [T, +00) 3 t 1— > = (x*, %(/;)} has a constant sign. Let us argue by 
contradiction and assume that there exist T < t\ < t% such that h(t\) < < hfe). Hence there 
exists t-s £ (t\,t2) such that h(tz) = 0. Since x(t) € B(a,5), this implies xfe) £ C an d thus by 
the uniqueness theorem for subgradient curves (Theorem 1 13[) . we have x(t) = x(*3) f° r an * — ^3 
which is a contradiction. Note also that if /i(io) = for some to > then x has finite length. 
Indeed applying once more Theorem [T3l we deduce that x(rj) = x(*o) f° r an * > *0j hence 




Assume that h is positive (the case h negative can be treated similarly) and define the following 
function 

{0 if (x, x*) < and x £ B(a, 5) 

f(x) if (x, x*) > and x £ 5 (a, <5) 
+00 otherwise. 

One can easily check that / is proper, lower semicontinuous, convex and that argmin / has non 
empty interior. Note also that df{x) = df(x) for all x £ B(a,5) such that (x,x*) > 0. The 
conclusion follows from the previous result and the fact that x(*)+^/(x(*)) 3 a - e - on (T, +00). 
□ 



4.2 KL-inequality for convex functions 

The following result shows that if / is convex, then the function ip of Theorem [T87i) can be 
assumed to be concave and defined on [0, 00). 

Theorem 29 (Subgradient inequality - convex case). Let f : H — ► K U {+00} be a lower 
semicontinuous proper convex function which is bounded from below (recall that inf / = 0). The 
following statements are equivalent: 

(i) There exist ro > and (p £ /C(0, 7*0) such that 

\\d(<p o /)(x)||_ > 1, for allx€[0<f< r }. 

(ii) There exists a concave function ip £ /C(0, 00) such that 

||d(V>o/)(x)||_>l, for all x If = 0}. (38) 

Proof. The implication (ii)=>(i) is obvious. To prove (i)=£-(ii) let us first establish that the 
function 

r£(0,+oo)^(r)= - ^ f{x)ll _ 

xe[f=r] 

is finite- valued and nonincreasing. Let < r^ < r\ and let us show that u(r2) > u{r\). To this 
end we may assume with no loss of generality that u(n) > (and therefore that [/ = random 3/ 
is nonempty). Take e > and let x\ £ [/ = 7*1] and p\ £ df{x\) such that u(r) < + e. 
Since the continuous function t 1— > f(Xx 1 (t)) tends to inf/ = as t goes to infinity (see [32] for 
instance), there exists t% > such that f{Xxi{p2)) = r 2- From Lemma [26] (iii) , we obtain 



\\df( Xa:i (t2)\\- ~ \\ P l\ 
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which yields ufa) > u(rx). By (i) the function u is finite-valued on (0,ro), thus, since u is 
nonincreasing, it is also finite- valued on (0, +00). 

It is easy to see that [(i)=^(v)] of Theorem 1181 holds without the compactness assumption 
d2U) (see RemarkQjJJ). It follows that u £ L 1 (0, ro) and by Lemma HH (Annex) that there exists a 
decreasing continuous function u £ L l {Q, ro) such that u > u. Reproducing the proof of (v) (i) 
of Theorem 1181 we obtain a strictly increasing, concave, C 1 function 

tp(r) := / u(s)ds 
Jo 

for which (|38p holds for all x £ [0 < / < ro]. Fix f £ (0, ro) and take ip as above. Applying ([38]) 
and using the fact that u(r) is decreasing we obtain 

1 < ?/; / (f)u(f)" 1 < if;' (f)u(r)- 1 < i//(f)\\p\\, 

for all p £ df(x), x £ [f < /] and r £ (f, +00) such that u(r) > 0. This shows that the function 
# : M+ -> M+ defined by 

^ (r) . = / ^( r ) if r < f, 

Tp(r) + ijj'(r)(r — r) otherwise. 

satisfies the required properties. □ 

A natural question arises: when does a convex function / satisfy the KL-inequality? In finite- 
dimensions a quick positive answer can be given whenever / belongs to an o-minimal structure 
(convexity then becomes superflous). The following result gives an alternative criterion when 
/ is not extremely "flat" around its set of minimizers. More precisely, we assume the following 
growth condition: 



There exists m : [0, +00) — ► [0, +00) and S C H such that 
atinuous, 
p m~ 1 (r) 



m is continuous, increasing, m(0) = 0, / > m(dist(-, C)) on S n dom/ 



and / dr < +00 (for some p > 0). 

Jo r 



Theorem 30 (growth assumptions and Kurdyka-Lojasiewicz inequality). Let f : H — > R U 

{+00} be a lower semicontinuous proper convex function satisfying t39\) and let us assume 
£ C := argmin/. Then the KL-inequality holds, i.e. 

\\d((p o /)(x)||_ > 1, for all x € S\ argmin/, 

with 

<p(r) = r iLMds. 



Proof. Let x £ SCidomdf and a be the projection of x onto the convex subset C = argmin/. 
Using the convex inequality we have 

f(x) - /(a) < (d°f(x), x-a)< dist (0, 8f(x)) dist (x, C) < dist (0, df(x)) m~ l {f{x) - /(a)). 

Using the chain rule (see Lemma H3j) an the fact that /(a) = 0, we obtain dist (0, d(<pof)(x)) > 1 
where 93 is as above (note that ip £ K,(0,p)). □ 
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Remark 31. Assume that H is finite-dimensional, and let S be a compact convex subset 
of H which satisfies SflC 7^ 0. Then there exists a convex continuous increasing function 
m : R + — > K.+ with m(0) = such that f(x) > m(dist(x, C)) for all x G S. 
[Sketch of the proof. With no loss of generality we assume that G S f] C. Using the 
Moreau-Yosida regularization (see [TT] for instance), we obtain the existence of a finite- valued 
convex continuous function g : H — > R such that / > g and argmin/ = argmin g. Set 
a = maxjdist (x,C) : x £ S} and mo(s) = min{g(x) : x G S, dist (x,C) > s} G M+ for 
all s G [0,a]. Let < si < S2 < a, and let X2 G 5 be such that dist (x2,C) > S2 and 
< g{x2) = m{s2)- Using the convexity of g and the fact that G argmin g n S, we see that 
there exists A G (0, 1) such that g(Xx2) < 5(^2), Ax2 G S (recall that S is convex and contains 
0), and dist (Xx2,C) > s±. This shows that the function ttiq is finite-valued increasing on [0, a] 
and satisfies mo(dist (x, C)) < < f{x) for any x G S. Applying Lemma 05] (Annex) to 
mo, we obtain a smooth increasing finite-valued function m such that < m{s) < mo(s) for 
s G [0, a] with m(0) = 0. The conclusion follows by extending m to an increasing continuous 
function on K+.] 

Example 32. Take < a < 1. If m(r) = exp(— l/r a ) and m(0) = 0, then for < s < p < 1 we 
have m _1 (s) = l/(— logs) 1//Q and 

/ — cis < +00. 

Jo s 

Therefore any convex function which is minorized by the function x 1— ► exp(— l/dist(x, C) Q ) in 
some neighborhood of C = argmin / satisfies the KL-inequality. 



4.3 A smooth convex counterexample to the KL— inequality 

In this section we construct a C 2 convex function on M? with compact level sets that fails to 
satisfy the KL-inequality. This counterexample is constructed as follows: 

- we first note that any sequence of sublevel sets of a convex function that satisfies the 
KL-inequality must comply with a specific property ; 

- we build a sequence of nested convex sets for which this property fails ; 

- we show that there exists a smooth convex function which admits as sublevel sets. 
The last part relies on the use of support functions and on a result of Torralba [41 J . For any 

closed convex subset T of M n , we define its support function by ar(x*) = sup xeT (x, x*) for all 
x* G M. n . Let / : M. n — > M be a convex function and x* G W l . Fenchel has observed, see [23] . 
that the function A 1— > a[f<x\(x*) is concave and nondecreasing. The following result asserts that 
this fact provides somehow a sufficient condition to rebuild a convex function starting from a 
countable family of nested convex sets. 

Theorem 33 (Convex functions with prescribed level sets [H]). Let {T^} be a nonincreasing 
sequence of convex compact subsets ofM. n such that int D Tk+i for all k > 0. For every k > 
we set: 

QTj^jx*} -Q-T k {x*) 

K k = max — — — G (0, +00). 

11**11=1 a Tk {x*) -a Tk+1 ( x ) 

Then for every strictly decreasing sequence {A^}, starting from Xq > and satisfying 

< K k (X k - X k+ i) < A fc _i - A fc , for each k > 0, 
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there exists a continuous convex function f such that 

T k = [f < Afc], for every k G N 
and fremg maximal with this property. 
Remark 34. 

(i) If {Afc} is as in the above theorem and x* G ]R n \{0}, we have 

Since the sum ^ (oj^ (x* ) — c7y fc (x*)) converges, so does the sum XX-^fc ~~ ^fc+i)i yielding the 
existence of the limit limA^. Since / is the greatest function admitting {TJt} as prescribed 
sublevel sets, we obtain min/ = limA^. 

(ii) Let k > and A G [Afc+i,Afc]. The function / satisfies further 

[/ < A] = (j^-) T k + (r^rM T fc+1 , (40) 

see (4TJ Remark 5.9]. 

The following lemma provides a decreasing sequence of convex compact subsets in IR 2 which 
can not be a sequence of prescribed sublevel sets of a function satisfying the KL-inequality (see 
the conclusion part at the end of the proof of Theorem |36|) . 



Lemma 35. There exists a decreasing sequence of compact subsets {T^}/. in M? such that: 

(i) Tq is the unit disk D := B(0, 1) ; 
(ii) C int Xfc for every k G N ; 

(Hi) P| Tfc zs i/te disk D r := B(0, r) for some r > ; 



fegN 



(iv) ^Dist(T fc ,T fc+ i) = +oo. 



fc=0 

Proof. We proceed by constructing the boundaries dT^ of T k for each integer k. Let C^^ denote 
the circle of radius 1 and let us define recursively a sequence of closed convex curves C n , jm for 
n > 3 and 1 < m < n + 1; we assume that C n _i in is the circle of radius R n > 0. Let {fj> n } be a 
sequence in (0, 1) that will be chosen later in order to satisfy (Hi). Then, for 1 < m < n, let us 
define C n , jm to be the union of the segments: 

H™R n exp(2z7r(|-)), ii™R n exp(2m(^-)) for < j < m — 1 (here i stands for the imagi- 
nary unit) 
and the circle-arc: 

- iCR n exp(i#) for 2vr^ < 9 < 2vr. 
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In other words, C n ^ m consists of the first m edges of a regular convex ra-gonon inscribed in 
a circle of radius n™R n and a circle-arc of the same radius to close the curve. We then set 

R n+1 = nl +l R n cos(-) 
n 

and define C n:Tl+ i to be the circle of radius R n +i- Figure [T] illustrates the curves and C^^ m 
for m = 1, . . . , 6. 




Figure 1: The curves C^,C^ t \ to C\ 



5.6 



Ordering {(re, m) : re > 3, 1 < m < re + 1} lexicographically we define succesively the 
convex subset T/- to be the convex envelope of the set C n ^ m . By construction (i) and (ii) are 
satisfied. Item (Hi) holds if limi? n > which is equivalent to the fact that the infinite product 
n+r 3 Ain +1 cos(vr/n) does not converge to 0. This can be achieved by taking \i n = 1 — 1/n 3 . Let 
r > be the limit of {Rn}- The intersection of the convex sets T n is the disk of radius r. 

,2m 

Take n > 3. Considering the middle of the segment 



[i n R n ,H n R n ex.p(- 



n 



in C n i and 



the point i? n exp(^) e C n -i <n , we obtain Dist(C n) i, C n -\^ n ) = R n (l — [i n cos(7r/n)). If 2 < 



m < n, considering the middle of 



i7r(2m— 1) • 



2m(m-\) 2mm 
H™R n exp( i '-, n™Rn exp( ) 



n 



n 

,m— 1 



in C nim and the 



point R n exp( 

Finally considering the points [i^Rn £ C nin and cos(7r /n)R n £ C n , jn+ i, we obtain 



e C n>m _i, we get Dist(C n , m , C n ,m-i) = /x™ 1 i2„(l - \i n cos(ir/n)). 

n+l 



Dist(C nin , C n>n+1 ) = fCRn(l ~ cos(7r/n)). 
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Thus 



n+1 n+1 9 o 

7I\ 7T 7T r 



Dist(C ni i,C„_i in ) + ^2 Dist(C„ jm ,C nim _i) = ^ ^™ 1 R n (l - p n cos-) ~ nr- = — . 

m=2 m=l 

Hence (if) holds. □ 

For # G M/2-7TZ, set n(#) = (cos 9, sin #) and r(#) = (— sin 9, cos 6). We say that a closed 
C 2 curve C in M 2 is convex if its curvature has constant sign. If moreover the curvature never 
vanishes, then there exists a C 1 parametrization c : M./2-kZ, — > C of C, called parametrization 
of C by its normal, such that the unit tangent vector at c(9) is t(9). In this case n{9) is the 
outward normal to the convex envelope of C at c(9). Moreover, c is C°°, whenever C is so. In 
this case, we denote by p c {9) the curvature radius of c at c{9) and we have 

c{9) = Pc (9)r(9). 

Let us denote by T the convex envelope of C. Using the fact that n defines the outward 
normals to T, we get 

(c(9),n(9)} = max(x,n(0)) = <r T {n{9)), V# G R/2ttZ. 

Theorem 36 (convex counterexample). There exists a C 2 convex function f : M 2 — > R such 
that min / = which does not satisfy the KL-inequality and whose set of minimizers is compact 
with nonempty interior. More precisely, for each r > and for each desingularization function 
if G /C(0, r) we have 

inf{||V(^o/)(x)|| :xe [0</<r]} = 0. 

Remark 37. 

(i) It can be seen from the forthcoming proof that argmin / is the closed disk centered at of 
radius r, and that / is actually C°° on the complement of the circle of radius r. 

(ii) The fact that / is C 2 shows that KL-inequality is not related to the smoothness of /. Besides, 
it seems clear from the proof that a C k (k arbitrary) counterexample could be obtained. 

(iii) Since argmin / has nonempty interior, Theorem [27] shows that the lengths of subgradient 
curves are uniformly bounded. Using the notation and the results of Theorem [20l we see that 
the function / shows that the uniform boundedness of the lengths of the subgradient curves 
(starting from a given level set [/ = ro]) does not yield the uniform boundedness of the lengths 
of the piecewise subgradient curves 7 lying in [min /</< Toll- 



Proof of Theorem 1361 Let M, N be topological finite-dimensional manifolds. In this proof, 
a mapping F : M — > N is said to be proper if for each compact subset K of N, F" 1 ^) is a 
compact subset of M. 

Smoothing the sequence T/%. Let us consider a sequence of convex compact sets {Tfc} as in 
Lemma [35j Set = dT^ and consider a positive sequence e/% such that ^ efc < +00 with 
efc + €k+i < Dist(Tfc, Tfc + i) = Dist(Cfc, Cfc+i) for each integer k. The e^-neighborhood of Ck can 
be seen to be disjoint from the e/v-neighborhood of Cy whenever k 7^ k' . We can deform Ck into 
a C°° convex closed curve Ck whose curvature never vanishes, lying in the e^-neighborhood of Ck- 
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This smooth deformation can be achieved by letting C k evolve under the mean-curvature flow 
during a very short time, see [22] for the smoothing aspects and [251 03] f° r the positive curvature 
results. We set T k to be the closed convex envelope of C k . This process yields a decreasing 
sequence of compact convex sets {T k }, that satisfies the conditions of Lemma [351 We note 
that the circle of radius 1 has non-zero curvature and we set Cq = C$. Since Dist(T&, T k +i) > 
Dist(Tfc, Tk+i) — (efe + £fc+i) and Yl e k < +°o, condition (iv) holds. With no loss of generality 
we may therefore assume that for each k > the curve dT k is smooth and can be parametrized 
by its normal. 

Let K k be as in Theorem I33[ let Ao and Ai be such that Ao > Ai. We define Afc recursively by 

K k (\k ~ Afc+i) = ^(Afc-i - A fc ). (41) 

Because of fl41 f) , Theorem [33] yields a continuous convex function / : To — > M such that T k = 
[f < Afc]. Since / is the greatest function with this property, we deduce that min/ = limAfc and 
argmin/ = n fceN T fc . 

Smoothing the function f on M. n \ argmin/. We can easily extend / outside T$ into a smooth 
convex function. Let us examine the restriction of / to To. Since dT k can be parametrized by 
its normal, we denote by c k : IR/27rZ — > R 2 this parametrization. Let us fix k G N. Let 9 be in 
R/27rZ. Using Remark [341 (b). we obtain 

max (x,n(9)) = f — ^ fc+1 ] max(x,n(#)) + ( - — ^— - J max (x,n(9)) 

VAfc — Afc+i/ x£T k \\ k — Afc+l/ xeT k+1 



X6[/<A] 



^ (c k (9),n(0)) + f /* / ) (c k+1 (9),n(0)) 



Afc — Afc + i/ \Afc — Afc + i 

^c k (9)+(T^A-)c k+1 (9)M0) 



.Afc — Xk+iJ \Afc — Afc + i 

Using (|40p once more we obtain 



A " Afc+1 ^ c k {9) + ( ) c k+1 (9) G [/ < A]. 



.Afc — Afc+i/ \Afc — Afc + i 

Since the above maximum is achieved in [/ = A], it follows that 

A — Afc + i \ ,„ s ( \h — A 



'I'rah^Hra 1 ^ 1 ^ (43) 



Let us define G : R x M/2vrZ -> M 2 by 



. Afc — Afc + i / \ Afc — Afc + i / 



The map G is clearly C°°. Since ^ = ShMh^+li^l we have 

A fc -A fc+1 



, Cfc (fl)-c fc+1 (fl) (cfc(g),n(g))-( C fc +1 (g),n(g)) 
(m-' n W) = \— x x ' n W> = \ T 

CA Afc - Afc+i Afc — Afc+i 

_ max xeTfc (x, n(0)) - max xeTfe+1 (x, n(0)) 
Afc — Afc+i 

> 0. 



31 



On the other hand 



dG 



\Afe — Afe+i/ \Afc — Afc+i/ / 



(44) 



Since p Cfc > and p Ck+1 > 0, G is a local diffeomorphism on (Afc+i — 5, Afe + 5) x M/27rZ 
for any 5 > sufficiently small. In view of (|42j) . we have G(X,9) G [\k+i < / < Afe] for 
Afc+i < A < Afc and G(X,9) G [Afc+i < / < Afe] for Afe +1 < A < Afe. Since the map G : 
[Afc-(-i,Afc] x R/27rZ — > [Afc+i < / < Afc] defined by G(X,6) = G(X,9) is proper, G is a covering 
map from [Afc+i, Afe] x W/2irZ to [Afe +1 < /' < Afe]. The set [Afc + i < / < Afe] is connected, thus G 
is onto. Using (f42l) and G(Xk,0) = Cfc(6>), one sees that (A&,0) is the only antecedent of Cfc(6>) by 
G and, since [Afc+i, Afe] x W/2ttZ is connected, G is injective. Thus G is a G°° diffeomorphism 
(see [HI Proposition 2.19]). By (j42j) . this implies that the restriction of / to [Afe+i < / < Afe] 
is G°°. Using (|42p . we know that the level line [/ = A] (for A& + i < A < A&) is parametrized 
by G(X,6) for 9 G M/27rZ; if c\ denotes this parametrization, then Cfe = c\ k . Besides, by flS} , 
c\ is a parametrization by the normal and p Cx is a convex combination of p Ck and p Ck+1 , hence 
Pc x > 0. 

9G 

Let us compute V/ at c x (9). Equation (02} yields 1 = (V/(G(A, 0)), — (A, (9)). Besides we 

c/A 

also know that the normal to [/ = A] at ca(0) is n(0). Since the gradient V/(G(A,0)) and the 
normal n(0) are linearly dependent, we obtain 

V/(c A (*)) = ^ Afc ~ Afc |^ (45) 
(c Afe (0) - c Afc+1 (0),n(0)) 

Note that this expression does not depend on A G [Afc + i — Afe]. 
Before going further let us observe/recall two facts. 

- First using the aforementioned result of Fenchel [23], we deduce from the convexity of / 
that the function 

A i ^ (c\(9),n(9)) = <7[f<\]{n(9)) is concave and increasing. (46) 

- Let A and A' be such that Afe+i < A < A' < Afe. We have : 



(Smoothing f around [f = Afe].) We have seen that the function / is G°° on the complement 
of the union of the level lines [/ = Afe] for k G N. In order to go further we need to modify / 
around each [/ = Afe]. 

Consider a positive sequence {efe} such that Yli e i < +°° an d £fc + e fc+i < Dist(Tfe, Tfe +1 ) = 
Dist([/ = Afc],[/ = Ajt+i]) for each integer k. Let us assume that there exists a sequence 
/fe : M? — > R of convex functions such that: 

pi /o = / ; 

P2 /fe = /fe_x outside an efe-neighborhood of [/ = Afe] ; 
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P3 f k is C°° in [/ > Afc+i] ; 

P4 ||V/fc|| is bounded in [/ < A&] by the maximum of ||V/|| in [Xk < / < A&_i]. 

Let us choose k > 1 and A, A' such that Xk+i < A < A& < A' < X^-i- Then by (f4Tj) and (H5 
we have: 



l|V/(c A (0))|| 
Hence 



Afc - Afc+i I A fc _i - A fc = I||Vf( (f)M\ 

(c Xk (e)- CXk+1 (9),n(e)) - 2( CXk _ 1 (e)-c Xk (e),n(e)) 2 11 J[ >»■ 



max ||V/|| < i max ||V/||. 



(49) 



Combining with (P4), the above implies that the sequence (/fc)fceN is uniformly Lipschitz con- 
tinuous. Applying Ascoli compactness theorem we obtain that fk converge to a continuous 
function / which is convex. From (P2) and (P3), we obtain successively that / has the same set 
of minimizers as /, / is C°° outside argmin/, [/ = A^] is in the e^-neighborhood of [/ = A&]. 
Moreover (|49p and (P4) imply that ||V/(x)|| goes to zero as x approaches argmin/, hence / is 
globally C . Note also, that the sequence of level sets [/ < A&] satisfies the hypothesis (iv) of 
Lemma [35l As shown in the conclusion, / provides a C 1 counterexample to the KL-inequality. 

Let us define such a sequence {fk} by induction. Assume that fk-i is defined. In order to 
construct fk-, it suffices to proceed in the e^-neighborhood of [/ = Afc]. Let e > such that 
[Xk — 2e < / < Afc + 2e] is in the e^-neighborhood of [/ = Xk]- Let us consider a C°° function 
,u_ : [— 2e, 2e] — > R which satisfies the following properties: 



1. fi- is nonincreasing, 

3. /i_(A) = -A/e on [-2e,-e/2], 

Let us then define ^+(A) := A/e + /U_(A) and /io 

1 . fi + is nondecreasing, 

3'. (J,+ (X) = on [-2e,-e/2], 



Set c_ 



ca 



Co = c Xk , c+ = c Xk+e and 

M_(e) = (c_(e),7i(0)) 
Mo(e) = (c o (0),n(^)): 
M+(0) = (c + (e),n(0)} 



2. ^ > 0, 

4. /i_(A) = on [e/2,2e]. 

1 — + The function /i + satisfies 

2 . /i + = //_ > 0, 

4'. /i + (A) = A/e on [e/2,2e]. 



= max (x,n(9)), 

zG[/<A fe -e] 

max (x,n(9)), 
xe[f<\ k y 

- max (x,n(9)). 

xe[f<\ k +e] 



For (A, 



-2e,2e] x R/2vrZ, we define: 

H(X,Q) = At _(A) c _(0) + / io (A)co(0) + At+ (A) c+ (0). 



Then is a C°° map and for any A G [— e, e], we have (i-(X), /^o(A) and /i+(A) in [0, 1]. Since 
.ff(A, 9) is a convex combination of points in [/ < Afc + e], we deduce H(X, 9) £ [f < Xk + e] and 
H(X, 9) G [/ < Xk + e] whenever A < e and ^+(A) < 1. Since 



(H(X,9),n(9)) = //_(A)Af_(0) + /z O (A)M o (0) + ^i + {X)M + {9) > M_(0), 
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we get H(X,6) £ [f > X k - e], and H(X,6) £ [f > Xk - e] whenever A > e, //-(A) < 1. It follows 
that 

fJ 

— = //_(A)c_(0) +/4(A)co(0) + // + (A)c+(0). 
Since //q = — fjf_ — items 1 and 1' entail 

(^,n(6)) = fi' + {X)(c + (9) - c (9),n(9)) - //_(A) (co(0) - c.(9),n(9)) 

= fi' + (X)(M + (9) - M (9)) - p'_(X)(M (9) - M_(0)) 
> 0. 

On the other hand 

ff 

— = (n-(X)p c _{9) + fj, (X)p CQ (9) + p, + {X)p c+ {9)) t(9), (50) 

OH OH 
so that (^r,n(9)) = and (^t,t(9)) > for A Gl - e',e'[ with e' > e. Thus H is a local 
09 09 _ 

diffeomorphism on ] - e', e'[x]R/2vrZ. The map # : [-e, e] x R/2vrZ -> [X k - e < f < X k + e] 
defined by H(X,9) = H(X,9) is proper, therefore H is a covering map from [— e,e] x M/2-7rZ to 
[Afc — e < / < Afc + e]. Since [A^ — e < / < X k + e] is connected, H is onto. Besides, since 
c + (9) E [/ = A + e], (e, 6) is the only antecedent of c + {9) by H, H is injective by connectedness 
of [— e, e] x M./2ttZ. H is therefore a C°° diffeomorphism from [— e, e] x ~R/2ttZ into [A& — e < / < 
Afc + e]- 

We then define f k to be f k -i outside of [A& — e < / < X k + e] and by f k (H(X, 9)) = X k + A 
in [X k — e < f < X k + e]. When X £ [X k — e, A& — e/2], Properties 3, 3' and equation (|4T|) yield 

(A - A fc , 0) = -^-^c_(0) + (1 + ^t) Co (9) 

Xk ~ A Cm . ^ - (A - e) //n 



Afc - (Afc - e) X k - (X k - e) 

= c x (9). 

Thus fk = f = /fc-i in [A fe - e < / < A fc - e/2] and for similar reasons f k = f k -i in [X k + e/2 < 
/ < Afc + e]. The "gluing" of and is therefore C°° along [/ = X k — e] and [/ = X k + e]. 
Hence, f k satisfies (P3). 

OH 

Let us compute V/ fe in [X k - e < / < A fc + e]. By definition of f k , 1 = {Vf k (H(X, 9)), —). 

Besides H(X — X k ,9) is a parametrization of the level line [f k = X] by its normal (see (|50p ). hence 
V f k (H(X, 9)) = an(9) with a > 0. Using both formulae, we finally get 

Vf k (H(X,9)) = ^ x){c+{e) _ Co{ e lnie)) _ ^(X)(co(9) - c-{6),n{0)) 
From the definition of fi + , p.' + (X) — p.'__(X) = 1/e. Besides, for A £ [— e, —e/2] we have 

{c (9) - c-(9),n(9)) 
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while for AG [e/2,e] we get 
Hence by (05]): 



||V/ fc (fr(A,fl))||<||V/(c Afc+e (fl))||. 

(P4) is therefore satisfied. 

The last assertion we need to establish is the convexity of f k . By construction, it suffices 
to prove that the Hessian Qf k of / is nonnegative in [X k — e < / < X k + e\. Let us denote 
by Qh the Hessian of H (observe that Qh takes its values in ]R 2 ). For — e < A < e, we have 
X + X k = f k (H(X,9)), thus 

= Q fk (H(X, 6))(DH(\, 6)(.), DH(X, #)(•)) + (Vf k (H(X, 9)), Q H (X, 9)(; •)) 

where DH denotes the differential map of H. To prove that Qf k is nonnegative, it suffices to 
prove that (Vf k (H(X, 6)), Q H (X, 9)(; •)) < 0. We have 

U = /x«(A)c_(e) +^(x)co(e) + ^(x) c+ (e) 

= f/L(X)(c-(e) - cq(9)) + M+(A)(c + (0) - c (9)) 
= /4(A) ((c+(0) - c (9)) - (c Q (9) - C _(0))), 

where the last equality is due to item 2'. On the other hand 

<V/ fe (iI(A, 9)), |J> = ^(A)||V/ fe (F(A, «c + (0) - co(0), n(*)> - (c (9) - c_(0), n(0)» 
which is nonpositive because of (06]). Besides we have 

(//_(A) Pc _(A) + ^(A) Pco (A) + p' + (X)p c+ (X))r(9), 
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thus (Vf k (H(X,9)), J^) = 0. Finally 

fl2 rr 

= (m_(A) Pc _ (0) + M*)Pc (9) + ^+(A) Pc+ (0)) (-n(0)) + ( • • • )t(9), 
hence the quantity 

is negative since all the p and p are nonnegative. Hence Qf k is nonnegative and the function f k 
is convex. 

C 2 smoothing. For A G (min/, Ao], define 

/i(A) = (A - min/)(l + max ||Q f -||) _1 . 

[A</<A ] 1 
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Since / is C°° in [min/ < /], h is a continuous, positive, increasing function. Then there exists 
ip G C°°(]R, R+) which vanishes on (— oo, min /], increases on (0, +00) and for A G (min/, Ao], 
< Y>(A) < h(X) (see Lemma 05]). Let 5 be the primitive of ip with g(min/) = 0. The function 
g is a strictly increasing convex C°°-function on [min /, +00). The function / = go f is therefore 
a C 1 convex function. Moreover / is C°° at each point outside the boundary of argmin /. For 
x G argmin/, we have 

V/(x + h)- V/» g'{f{x + fr))V/> + = g'(Rx) + (||fr||))o(l) o(|H|) = 

INI INI INI INI ° l >' 

Thus Qj{x) = 0. On the other hand 

\\Qf(x + h)\\< g'(f(x + /iJJUQ/Cx + h)\\+ g"(f(x + /i))||V/(x + h)\\ 2 
<h(f(x + h))\\Qf(x + h)\\+o(l) 
<(f(x + h)-f(x))+o(l)=o(l). 
Thus Qj is continuous at x and thus / is C 2 . 

Conclusion. Let us prove finally that / does not satisfy the KL-inequality. Towards a contradic- 
tion, let us assume that there exist R > inf / = min /, a continuous function ip : [min /, R) — ► R + 
which satisfies ip(mmf) = 0, 99 is C 1 on (min/, R) with <// > 0, such that we have 

||V(^o/)(x)|| > 1, VxG [min/</<i?]. 

Applying Theorem [20l [(i)<^(vi)], we obtain 

Dist([/ < g(X k )], [/ < ^(A fc+ i)]) < ¥>(g(A fc )) - ^(^(Afc+i)). 

and, as a consequence, 

+00 +00 

^Dist([/~ < A fc ], [/ < A fc+ i]) = ^Dist([/ < <?(A fc )], [/ < 5 (A fe+ i)]) < ^G?(A )). This contra- 

k=0 k=0 

diets the fact that Yl Dist(Tfc, T^+i) = +00. □ 

4.4 Asymptotic equivalence for discrete and continuous dynamics 

In this part we assume that / : H — ► R is a C ' convex function, that is, continuously differen- 
tiate with gradient V/ Lipschitz continuous. Let L be a Lipschitz constant of V/. 
Fix /3 > and x G R n and consider any sequence {5^} satisfying 

/3||V/(y x fc )|| \\Y k + l -Y k \\ < f(Y k )-f(Y k + l ), k = l,2,... 

(51) 

Yx = x 

This condition has been considered in [TJ for nonconvex functions defined in finite-dimensional 
spaces. It is easily seen that (|5ip is a descent sequence, that is, f(Y k ) > f(Y k+1 ), which implies 
in particular that {f(Y k )} converges as k goes to infinity 

Condition (I51j) is fulfilled by several explicit gradient-like methods, including trust region 
methods, line-search gradient methods and some Riemannian variants; see [TJ for examples and 
references. 

The following theorem establishes connections between length boundedness properties of 
continuous gradient methods and length boundedness of discrete gradient iterations. 
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Theorem 38 (discrete vs continuous). Let f be a C ' convex function with compact sublevel 
sets such that min / = 0. Let us denote by L a Lipschitz constant ofVf. Then the following 
statements are equivalent: 

(i) [Kurdyka-Lojasiewicz inequality] There exist ro > and (p £ /C(0, ro) such that 

||V(y>o /)(x)|| > 1, for all x G [0 < f < r \. (52) 

(ii) [Length boundedness of piecewise gradient iterates] For all (3 > and all R > 0, 

there exists C{(5) > such that for any sequence of gradient iterates of the form 

X()l 1 XQ ! • • • ! XQ J 1 Xl ' " " " 1 X\ ! - - - 

with f(x Q ) < R, f(Y£. +1 ) = f(xi+i) < f(Y£) and {vj. : j = 0, . . . , h] satisfying M> for all 
i £ N we have 

+oo hi 
i=0 1=0 

(Hi) [Length boundedness of piecewise gradient curves] For every R > there exists 
C > such that 

length (7) < £, 

for all piecewise subgradient curves 7 : [0, +00) — > H with /(7(C))) < R. 

Proof. Let us first prove that (i)=^(ii). By Theorem I29|/i)=>(ii)] (subgradient inequality - 
convex case) we may assume that ip is concave, defined on (0, +00) and (|52h holds for all 
x G [0 < /]. We now proceed in the spirit of p]. Let f3 > 0, x £ [0 < /] and let Y" x °, . . . , Y£ be 
a (finite) sequence of gradient-type iterations that satisfies (f5Tj) . For simplicity we set Yi = Y 3 
for all j € {0, . . . , k}, so that 

f{Y J ) f(Y j+1 ) > P\\Vf(Yi)\\ \\Yi +1 -Yl\\. 
Multiplying both parts with ip'(f(Y 3 )) and applying (i) we get 

<p'(f(Yi))[f(Yj)-f(Yi +1 )} > 0\\Yi +1 -Yi\\. 
Since (p is concave we have 

rtf(Yi +1 )) < p{f{Y 3 )) + p'(f(Y 3 )) [f(Y 3+1 ) - f(Y% 

and therefore 

p(f(Y 3 ))-p(f(Y 3+1 )) > (3\\Y j+1 -Y j \\- 
Adding the above inequalities for j = 0, . . . , k we obtain 

k 

p(f(Y°))-p(f(Y k )) > pJ2\\ Yj+1 - Yj W- ( 53 ) 

j=0 

Let us now consider a sequence of the form {Y® , Y* , . . . , Y" x fc °, Y^, . . . Y^ 1 , . . .} as in (ii). Then 
applying ([53]) to each subsequence {Y^, j = 0, . . . , k{\ we deduce 

+00 ki 

EEH y i +1 -^H < grtf&Z)) < -MR), 

i=0 1=0 P P 
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which proves the assertion. 

The equivalence (i)<^=^(iii) follows from Theorem 1 1 8 1 and Theorem [29j To complete the proof 
it suffices to establish that (ii) implies the assertion (iv) of Theorem 1181 (valley selection of finite 
length) (in fact we prove (iv') with R = 2). So let us assume that (ii) holds and let ro > m. We 
aim to construct a piecewise absolutely continuous curve 9 : (0, ro] — > W 1 of finite length that 
satisfies 



«(r)£V 2 (r):= i£[/ = r]: ||V/(x)|| < 2 inf \\Vf(y)\\ \, Vr G (0, r }. 

We shall use the explicit gradient method described in Subsection 15.21 Let xq G V2(ro) be such 
that 

||V/(x )|| <| inf \\Vf(y)\\, 



and consider the C 1 curve 



Set to = sup Aq where 



[0, ^)9t^ x (t) := x - iV/(x ). 



f o xq strictly decreasing on [0, t], 
xo(r)€V 2 (/(x (r)) forre[0,t]. 



Clearly Aq is nonempty and < to < (3L) . Set r\ = /(xo(to)) < r o and take x\ G V2(ri) 
such that 

||V/(zi)|| < § mf ||V/(y)||. 

Proceeding by induction we obtain a sequence {(ife,rfe, X&)} where {r^.} C [0, ro] is strictly 
decreasing, x n {t) := x„ — iV f(x n ) with f(x n ) = r n and 

||V/(x n )|| < | inf ||V/(y)||. 

Zy£[f=r n ] 

Let us denote by the limit of {rk} and let us assume, towards a contradiction, that r^ > 0. 
Set 

s(r) := inf ||d/(x)||_ and Soo = liminf s(r„) = lim s(r n ) 

(note that convexity of / guarantees that s(ri) < s(r 2 ) whenever n < r 2 ) and observe that 
r^ > implies that > (use the compactness of the sublevel set [/ < r ]). Let n G N be 
such that s(r n ) < |soo for all n > no- For n> uq and i G [0,t n ), Proposition's] (Annex) yields 

||V/(x n (t))|| < (Lt+l)||V/(x n )||, 

which implies 

q 15 
||V/(x„(i))|| < (Lt + l)||V/(x n )|| < -(Lt + l) S (r„) < -(Li + l) Soo . 

A sufficient condition to have x n (t) G V2(/(x n (t))) is therefore 

^(Lt + l) Soo < 2soo 0<f<(15L) _1 . (54) 
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Similarly we can estimate the rate of decrease of f(x n (t)). Since 

J t f{Xn{t)) = -(Vf(x n ),Vf(x n (t))), 

the condition ^f(x n (t)) < is satisfied whenever 

||V/(x n )|| 2 > ||V/(x„)|| \Nf(x n (t)) - Vf(x n )\\ 

But since V/ is Lipschitz continuous, \\V f (x n (t)) — V/(x n )|| < Li||V/(x n )||. Thus the condition 
is satisfied if 

||V/(x n )|| 2 >Lt||V/(x n )|| 2 

This last inequality is equivalent to t < L~ l , which implies in particular that for all n 6 N such 

L 
i' 



that s(r n ) < |sqo, we have 



In this case Proposition 08](Annex) yields 

f(x n (tn)) < f(x n ) + (^-t n )\\Vf(x n )\\ 2 <r n + ^(^-t n )s(r n ) 2 <r n + ^^-t n )(^ Soc f. 
Thus in order to have f(x n (t)) < r m , it suffices to require 

t _L% _6£ fr n - roo 2 
2 225 

Using the fact that (3L)" 1 > t n > (15L) -1 , we see that 

tn~^Y> (^r 1 - (18L)- 1 = (90L)" 1 . 

Since (r^ — r n )/soo tends to zero, we have that f(x n (t n )) < for n sufficiently large, which 
is a contradiction. 

We thus conclude that {r^} — ► = and (0, ro] = U n (r n+ i,r n ]. We define 6 : (0, tq] — ► H 
as follows: 6{r) := x n ([f o x n ] _1 (r)) whenever r E (r n+ i, r n ]. Clearly 6> defines a piecewise 
absolutely continuous curve. To see that 6 has finite length it suffices to observe that the 
sequence {x n } n is a sequence of gradient iterates that satisfies ([5"T]) . Using Remark H9l and the 
fact that the step-sizes in the construction of the x n 's do not exceed (3L) _1 we infer that 

5 

- - x n \\ ||V/(x n )|| < f(x n ) - f{x n+i ). 

D 

Hence the curve 8 has a finite length. This completes the proof. □ 



Remark 39. The assumption that / is convex has been used to apply Theorem[29](cf. concavity 
of ip which seems to be crucial for the proof of implication (i)=>(ii)) and to assert that /(Yq) — ► 
inf /. These are the reasons for which Theorem 1381 is not stated for general semiconvex functions 
(in a local version). It would therefore be interesting to figure out under which type of conditions 
(other than convexity or o-minimality of /) the function ip of (|52p can be taken concave. 
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5 Annex 

In this Annex section we give several technical results which are needed in the text. 
5.1 Technical results 

Proposition 40 (closed graph of the subdifferential). Let f : H — ► R U {+00} be a lower 
semicontinuous semiconvex function. Let {xk} and {pk} be two sequences in H such that pk G 
df(xk), x^ converges strongly to x and pk converges weakly to p. Then as k — > +00 we obtain 

f(x k ) - f{x) 
p G df(x) 

Proof. This is a standard property. For a proof (in the more general setting of primer-lower- 
nice functions) we refer the reader to [341. □ 



Proposition 41 (slope functions and semicontinuity) . Let f : H — > R U {+00} be a lower 
semicontinuous semiconvex function. 

(i) The extended-real-valued function 

H 3 x 1 — > \\df{x)\\- := inf ||p|| (slope at x) 

p£df(x) 

is lower semicontinuous. 

(ii) Take ro £ 1 and let D be a nonempty compact subset of [f < ro). Then the function 

(— 00, ro] 9 r 1 — ► si)(r) := inf ||<9/(x)||_ (minimal slope of the r level-line) 

xe[/=r]nD 

is lower semicontinuous. 

(Hi) Assume that (f23|) and (|24"j) /ioW for some f, e > 0. // < r\ < r^ < f, i/ien i/iere exists 
ry ri r2 > smc/i i/iai 

inf _ ||9/(x)||_>»H,r 2 >0. 

^e[ri</<r 2 ]nB(f,e) 

Proof, (ii) Take r G (— 00, ro] and let {r^} C (— 00, ro] be a sequence such that r& — > r 
and liminffc S£>(rfc) < +00. Fix 77 > and let (xk,Pk) G graph 5/ be such that = r^, 

Pfc G df(xk) and ||pfc|| < so(rk) — V- Using a standard compactness argument together with 
the fact that liming sr>(rk) < +00 we can assume, with no loss of generality, that Xk converges 
(strongly) to x G D and that pk converges weakly to p. Using Proposition [40l we obtain that 
(x,p) G graph 9/ and f{x) = r. The conclusion follows from the (weak) lower semicontinuity of 
the norm. Indeed 

liminf sj)(rfc) — 77 > liminf ||pfc|| > ||p|| > s/j(r). 

The proof of (i) and (hi) involve similar arguments. □ 

Lemma 42 (strong slope). Let f be a proper lower semicontinuous semiconvex function. Then 
for all x in H 

||a/(x)||_ = |v/|(x). 
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Proof. Let x G H and p = d°f(x) the projection of on df(x). By (|18f) . for any y £ F, we 
have 

(/( ^ = )+ < (-(P, /^jt) + - *|| 2 ) + < (INI + ally - x|| 2 ) + . 

I \y ~ x \ I I \y ~ x \ I 

By taking the limsup as y — > x, we get \Vf\(x) < ||p|| = To prove the opposite 

inequality, we consider the subgradient trajectory Xx- If x is a critical point of /, then = 
||5/(x)||__ > |V/|(x). Otherwise, Xx(t) ^ x for all t > 0. By Theorem [±31 iv) . we have 



\\x-Xx(t)\\ ~ \\x-Xx(t)\\ J 

Taking the limsup as t j and using the continuity of the semiflow and Theorem 1 13lfii) . (iii) we 
obtain the desired result. □ 

Lemma 43 (chain rules). Let f : H — > M U {+00} be a extended-real-valued function. 

(i) Let ip : (0, 1) — > ]R be a C 1 function. Then 

d(<pof)(x) = <p'(f(x))df(x), for allxe[0<f< 1]. 

(ii) Let j : (0,1) ^ H be a C 1 curve. For all t G (0, 1), we have 

S(/o 7 )(*)D{(7(<),P(t)>:p(t)ea/( 7 (*))}. 
Proof For the proof see [42] for example. □ 



Lemma 44 (continuous integrable majorant). Let u : (0, ro] — > 1R+ &e an upper semicontinuous 
function such that u G L 1 (0, ro). T/ien there exists a continuous function w : (0, ro] — ► M+ swc/i 
t/iat w > u and w G L 1 (0, ro). // moreover u is assumed to be nonincreasing, w can be chosen 
to be decreasing. 

Proof With no loss of generality we assume ro = 1. Replacing if necessary u(-) by the function 
u(-) + 1 we may also assume that u > 1. Let a k > be a strictly decreasing sequence such that 
ao = 1 and (0,1] = Ufcg^fafc+i, ak]. Let us assume that there exists a sequence of continuous 
functions w k : [a k+1 ,a k ] -> R such that w k > u on [a fc+1 ,a fc ] and w k < /°* u + ^pjp- 
To establish the existence of io, we proceed by induction on k. Fix k > 1 and assume that -zw is 
defined on [a^, 1] with w > u, w continuous and 

/ ^ / u +E-r 

There is no loss of generality to assume w k (a k ) < w(a k ) (the case w k (a k ) > w(a k ) can be treated 
analogously). Let us define 

n Wk(ak){ak ~ a k+ i) 

U < ( k = t, , 7Tn — t — ; < a k - a k+ ±, 

(k + ly w(a k ) max [afe+liafc] w k 

and let us consider the functions 

\ r in w ( Q fc) i 

w fc (a fc ) 
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defined by 



. , . 1 /. , . . w(a k ) 

Afc(r) = — \(a k - r) + (r - (a k - e k ))- 



£fc V w k (a k ) 
The function u> can be now extended to [a/%+1, 1] by setting 



w(r) 



w k (r), if r £ [a k+ i,a k - e k ) 

h(r)w k (r), if r G [a k - e k , a k ] 
^ -w(r), if r G (a k , 1]. 



It is easily seen that the function w is continuous (by definition of X k ), it satisfies w > u on 
[<2fc + i,afc] (thus on (a k+ i, 1]) and moreover 



/"i ra>k-£k ro-k /■! 

/ w = / ^fc + / + / 

Ja k+ i J a k+ i Ja k —e k J a k 

< 



W 



1 «,(a fc ) f 1 A 2 



(fc + l) 2 iu fc (a*.) [ofc+LOfc] 7 afc 



This proves the existence of a continuous function ui that satisfies the required properties. 

To complete the proof it suffices to prove the existence of such a sequence {w k }. To this 
end, fix k £ N* and set 

u e (r) = sup { U (p)-t_£\l } . 

pe[a k+1 ,a k ] ze 

It is easily seen that u e is continuous, u(r) < u e (r) < max p ^ ah +1)0fc ] it := M k < +oo and 
lim e ^o u<E ( r ) — u ( r ) for all r G [ a fc+i>&fc] (see [42j, for example). Note that the upper semicon- 
tinuity of u on the compact set [afc+i,afc] guarantees that M k is finite. Applying the Lebesgue 
domination convergence theorem we conclude that u e converges to u in the norm topology of 
L 1 (afc+i,a/ c ). Thus there exists eg > such that 



/ 

J fa. 



u e ° < I U + 



[a k+1 ,a k ] J{a k +i,a k ] (k + l) 2 

Thus the function w k := u e ° satisfies the requirements stated above. This completes the proof 
of the first part of the statement. The case where u is assumed decreasing, can be treated with 
similar (and occasionally simpler) arguments. □ 

Lemma 45. Let h £ C°((0, r ], K+) be an increasing function, then there exists a function 
ip £ C°°(1R, 1R+) such that ip = on K_, < ip(s) < h(s) for all s £ (0,ro), and ip is increasing 
on (0, r ). 

Proof. Let us extend the definition of h by on R_ and h(ro) for s > r$. Consider (ft £ 
C°°(1R,M + ) with [0, 1] as support and L = 1. Then we define ip by ip = <j) * h; i.e. ip(s) = 
j^4>(t)h(s — t)dt. It is then straightforward to verify that ijj satisfies the expected properties. □ 
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Proposition 46 (Piecewise absolutely continuous selections). Let r$ > and V : (0, ro] =t H 

be a set-valued mapping with nonempty values. Assume that for each r G (0, ro] there exists 
e r G (0, r) and an absolutely continuous curve 9 r : (r — e r ,r] — > H such that 

9 r {s) G V(s) for all s in (r — e r ,r]. 

Then there exist a countable partition {/ n }nGN of (0, ro] into intervals I n of nonempty interior 
and a selection 6 : (0, ro] — > W 1 ofV such that 6 is absolutely continuous on each I n . 

Proof. Let Q be the set of couples {a : I a C (0, ro] — ► M n , {L a ,j}jeJ a ) where {I a ,j}jeJ a is a 
countable partition of I a into (disjoint) intervals I a j G J Q with nonempty interior such that: 

(a) for each j G J a , a is absolutely continuous on / a j, 

(b) for each r G I a , a(r) G V(r). 
We define a partial order =^ on SI by 

at =4 a 2 Vjf G J Ql , 3/c G J Q2 , / aij j C / a2j fe and ai(r) = ct2(r) for all r G I ai . 

Note that (O, =^) is nonempty partially ordered. Let us check that each totally ordered subset 
of f2 has an upper bound in Q. To this end, let 

w = {(ai,{ia,j}jeJa,)}i6r 
be a totally ordered subset of f2. For each r G Ui e cl ai define a(r) by 

a(r) := a;(r), 

whenever r G //, and set J Q = Uz 6 z,/ Qr Since cj is totally ordered, the mapping a : Iq, — > M n 
is well defined and (b) is clearly satisfied. For / G L and j G J/, set := J ai , I ai j = Iij and 
D : = {(m, k) : m £ L, k G J m .}- For each (/, j) G D, let us define 

Mij:= |J J m , fe . (55) 

{m,k)&D, h^dlm.k 

Observe that J Q = UpjjgoMjj and that each My is an interval with nonempty interior. 

Let us prove that for all (l,j), G D, we have either Myji = Mij or Mj/j/ n My = 0. 

In order to establish this result, let us beforehand show that for all (l',f) in D such that 

h,j n Iy j/ 7^ 0, we have Mi j = Myj>. Indeed, since uj is totally ordered, we have for instance 
lyji C Iij and so My C Myj/. Conversely, take (m,k) G D such that I mi fc D izy. Since 
^m,fc H iy 7^ 0, we have either J mi jt C iy or D iy, in any case we see (c/. definition ([551) ) 
that I m; fc C My and thus My ji C My. 

If MijCiMy ji 7^ 0, take r in the intersection, and observe that by definition there exist (m, k) 
and (m', fe') in D such that I mj fc D iy with r G / m> fe and / m ',jfc' D Iyj' with r G I m ',k'- Using the 
previous remark, we obtain that M m ^ = Mi j and M m > # = Myj/. But since I m ,k H / m ' 7^ 0, 
we also have M m ^ = M m i # and thus Mi a = My j/. 

Let us define an equivalence relation ~ on D by 

(l,j)^(l',j')^M ld = My, f . 
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This equivalence relation defines a partition of D into equivalence classes. By the axiom of choice 
we can pick one and only one element in each equivalence class and this defines a nonempty 
subset D' of D. By construction we have I a = Unj^jjrMij and Mij n Mi'ji = for each 
Q>jj) (^)i') m D' . Besides since each M;,- (for (l,j) G D') has a nonempty interior, we see 
that D' is a countable set. This shows that (a,{Mij, G D'}) is in Q with in addition 

a > ai for all Z G C. 

Applying Zorn's lemma to £1, we obtain the existence of a maximal element (6 : Ig — > 
M n , {-Zflj, i G Je}). Arguing by contradiction, we see immediately that Ig = (0, ro]. □ 

5.2 Explicit gradient method 

We recall the following useful result 

Lemma 47 (Descent lemma). Let f be a C ,l function (that is, V/ is L-Lipschitz continuous). 
Then 

f(y) < f(x) + (V/(x),y - x) + ^\\y - x\\ 2 . 
Proof Set x(t) = x + t(y — x) and notice that 

f(y) - fix) = £ j t f(x(t))dt = (V/(x), y -x) + J\\7f(x(t)) - Vf(x),y - x)dt. 

The assertion follows easily. □ 

Given x G H, let us consider the following recursion rule 

x + := X(t,x) =x-tVf(x), t > 0. (56) 

Choosing a starting point x° in H, and > a sequence of step size, the explicit gradient 
method writes 

x k+1 = X(X k ,x k ). 

A part of the convergence analysis of this method (and some of its variants) is based on the 
following elementary results. 

Proposition 48. Let f be a C 1 ' function, x G H, t G [O^L" 1 ) and x + be given by (|56p . Then 
(i) (1 - f ) - x|| ||V/(x)|| < fix) - f(x+) ; 
(n) ||V/(x+)|| < (Lt + l)||V/(x)||. 

Proof Assertion (i) follows directly from Lemma [47] while assertion (ii) is a consequence of the 
fact that V/ is Lipschitz continuous on [x, x(t)} of constant L. □ 

Remark 49. Condition (f5T1) of Section I4~4l corresponds of course to the inequality (i) above. 
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